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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Certification under 37 CFR 1.10 (if applicable) 

I hereby certify that this application is being deposited with the United States Postal Service "Express 
Mail Post Office to Addressee" service Mail Label No. EM6O14575750S under 37 CFR §1.10 on the date indicated 
below and is addressed to: Assistant Commissioner for Patents, Attn: Box Patent Application, Washington, 
D.C. 20231. 

Thomas Fitting, Reg. No. 34,163 Date of Deposit 



Anticipated Classification of this Application: 

Class Subclass 

Prior Application: 

Examiner S. Priebe 

Art Unit 1805 



ASSISTANT COMMISSIONER FOR PATENTS 
Washington, D.C. 20231 
ATTN: BOX PATENT APPLICATION 

Sir: 

FILING UNDER 37 CFR 1.60 WITHOUT 
ABANDONMENT OF THE PENDING PRIOR APPLICATION 

This is a request for filing a continuation, X divisional, 

patent application under 37 CFR 1.60 of pending prior application 
Serial No. 08/286 . 888 filed on August 5. 1994 for LAMBQID 
BACTERIOPHAGE VECTORS FOR EXPRESSION OF FOREIGN PROTEINS and 
which is not being abandoned at this time. 
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Enclosed is a true copy of the above -identified prior 
complete application, including the specification 
(including the claim or claims), the drawings (if the 
prior application included drawings) , the signed oath or 
declaration, and any amendments referred to in the oath 
or declaration filed to complete the prior application. 
I hereby verify that these enclosed papers are a true 
copy of what is shown in my records to be the above - 
identified prior application and that no amendments 
referred to in any oath or declaration that may have 
been filed to complete the prior application introduced 
new matter therein. 



X_ The filing fee is calculated below on the basis of the 
claims existing in the prior application and, if 
applicable, as amended at Item 5 below to the extent 
that the amendments reduce the number of claims. 



FOR NO. FILED 


NO. EXTRA 


RATE 


CALCULATIONS 


TOTAL 29-20 = 
CLAIMS 


9 


x $22.00= 


$ 198.00 


INDEPEN. 7-3 = 
CLAIMS 


4 


x $80.00= 


160 . 00 


MULTIPLE DEPENDENT 
(if applicable) 


CLAIM (S) 0 


+ $260.00= 


0 






BASIC FEE 


+ $1128.00 




Total of Above 


Calculations 


= $1128.00 



Reduction by X for filing by small entity 
(Note 37 CFR 1.9, 1.27, 1.28). If applicable, 
verified statement must be attached. 



TOTAL 



= $1128.00 
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3 . Method of Payment of Fees 

a. X Enclosed is a check in the amount of S 1. 128.00 

b. Charge Deposit Account No. 19-0962 in the amount of 

$ . A duplicate of this request is 

attached. 

4. Authorization to Charge Additional Fees 

The Commissioner is hereby authorized to charge the following 
additional fees which may be required to Deposit Account No. 19- 
0962 . 

a. X 37 CFR 1.16 (filing fees and fees for presentation 

of extra claims) . 

b. X 37 CFR 1.17 (application processing fees). 

c. 37 CFR 1.18 (issue fee at or before mailing of 

Notice of Allowance, pursuant to 37 CFR 1.311(b). 

5 . Amendment 

It is understood that only amendments reducing the number of 
claims or adding a reference to the prior application under 
37 CFR 1.78(a) will be entered before calculating the filing 
fee and granting the filing date. 

a. X Amend the specification by inserting before the 
first line of the sentence: 

This is a continuation, X divisional, of 

application Serial No. 08/286,888, filed August 5, 
1994 , whose disclosures are incorporated herein by 
reference . 
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b. Cancel in this application original claims 

inclusive, of the prior application before 
calculating the filing fee. (At least one original 
independent claim must be retained for filing 
purposes . ) 

c. A preliminary amendment is enclosed. 



6 . X Xerographic drawings are a part of the enclosed 

application copy identified in Item 1 above, and the 
drawings of the prior application are to be 
transferred. 

a. New formal drawings are also enclosed. 

b. New formal drawings are not enclosed at this 

time . 

7. X The above-identified prior application is assigned of 

record to The Scripps Research Institute 

8. X Power of Attorney 

a. X The power of attorney in the above-identified 

prior application is to Thomas Fitting 

Reg. No. 34 . 163 , and the other 

attorneys or agents identified in the power in 
the prior application. 

i • The power of attorney appears in the 

original papers in the prior application; a 
copy of which is enclosed. 

ii. X The power of attorney does not appear in 
the original papers of the prior application, 
and a copy of the power in the prior 
application is enclosed. 



b. 



A new power of attorney has been executed and 
is attached. 
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9. Priority of application Serial No. filed on 

is claimed under 35 U.S.C. 119. 

10. The certified copy of the priority application Serial 

No. identified in Item 9 above has been 

filed in above-identified prior application Serial No. 
filed on . 

11. Enclosed is a copy of the verified statement (s) 

supporting a claim for, and/or claiming, small entity 
status under 37 CFR 1.9 and 37 CFR 1.27 as filed in the 
above-identified prior application under 37 CFR 1.28(a), 
and such small entity status is claimed in this 
application . 

12. An original executed verified statement (s) supporting a 

claim for, and/or claiming, small entity status under 37 
CFR 1.9 and 37 CFR 1.27 is enclosed. 

13. A petition, fee, and response has been filed to extend 

the term in the pending prior application until 



14. X Please transfer to this application, in accordance with 
37 CFR 1.821(e), the computer readable copy(ies) from 
applicant's other application Serial No. 08/286 , 888 , 
filed August 5, 1994 . 



All papers accompanying this submission, or for which a 
request for transfer from applicants 7 other application, 
introduce no new matter. 
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Address all future communications to: 

THE SCRIPPS RESEARCH INSTITUTE 
10550 North Torrey Pines Road 
Mail Drop TPC 8 
La Jolla, California 92037 
(619) 784-2937 

The undersigned declares further that all statements 
made herein of his or her own knowledge are true and that all 
statements made on information and belief are believed to be 
true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 10 01 
of Title 18 of the United States Code and that such willful false 
statements may jeopardize the validity of the application or any 
patent issuing thereon. 

Date Signature 

Thomas Fitting 

Printed Name 

X Attorney or agent of record 

Filed under Rule 34 (a) 

Registration No. of Signator: 34 . 163 

Correspondence Address of Signator: 

THE SCRIPPS RESEARCH INSTITUTE 
10550 North Torrey Pines Road 
Mail Drop TPC 8 
La Jolla, California 92037 

Telephone Number of Signator: (619) 784-2937 
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LAMBDOID BACTERIOPHAGE VECTORS 
FOR EXPRESSION OF FOREIGN PROTEINS 05/06/97 



Technical Field 

The present invention relates to cloning vectors 
and methods for producing a library of DNA molecules 
capable of expressing a fusion polypeptide on the 
surface of a lambdoid phage particle. 

Background 

The expression of polypeptides fused to the 
surface of filamentous bacteriophage provides a 
powerful method for recovering a particular sequence 
from a large ensemble of clones (Smith et al 
Science , 228:1315-1517, 1985). Peptides binding to 
avidin or antibodies have been selected from large 
libraries by the relatively simple method of panning 
(Scott et al., Science , 249:386-290, 1990; Devlin et 
al., Science , 249:404-406, 1990; and Cwirla et al . , 
Proc. Natl. Acad. Sci. U.S. A , 87:6378-6382, 1990) . 
Larger proteins, such as antibodies (McCaf f erty et 
al., Nature, 348:552-554, 1990; Lowman et al . , 
Biochemistry , 3 0:10832-1083 8, 1992; and Rang et al . , 
Proc. Natl. Acad. Sci. U.S.A. , 88:4363-4366, 1991) and 
human growth hormone {Bass et al . , Proteins , 8:309- 
314, 1990), have also been fused to surface proteins 
of filamentous phage which can then be used for the 
selection of particular sequences from a large number 
of variants. 

In these filamentous phage systems, the foreign 
protein or polypeptide is fused to the amino- terminus 
of either coat protein III or coat protein VIII of the 
filamentous phage M13 and the fused protein is 
secreted through the Escherichia coli cytoplasmic 
membrane into the periplasmic space. Most of the 



- 2 - TSRI 432 . 0 

proteins successfully fused with filamentous phage 
surface proteins have been secreted proteins and, 
although Rebar and Pabo ( Science , 263:671-673, 1994) 
have recently used pill to display three zinc fingers 
of Zif268 (a DNA-binding protein) , many cytoplasmic 
proteins will interfere with the passage of the fusion 
product from the cytoplasm to the periplasm. 

Furthermore, cDNA gene products cannot be 
directly expressed as fusion proteins to the amino- 
terminus of the viral coat proteins due to 
transcriptional stop sites present at the 3' end of 
non-translated regions in eukaryotic cDNA obtained by 
poly(A + ) selection of mRNA followed by poly (A) priming 
(Molecular Clo ning: A Laboratory Manual, Second 
Edition, Maniatis et al . , eds . , Cold Spring Harbor, 
NY, 1989) . These facts prevent the use of filamentous 
phage systems developed thus far for cDNA library 
screening. 

Herein described is a lambda vector system useful 
for the expression of a foreign protein on its surface 
which should be more appropriate for proteins that 
fold in the cytoplasm. The lambda vector of this 
invention could therefore complement the filamentous 
phage system in the identification of novel 
polypeptides with a biological activity. 

Brief Summary of the Invention 

A new surface-integration technology has been 
discovered for expressing a cytoplasmic recombinant 
gene product on the surface of a lambdoid phage 
containing the recombinant gene. The invention uses a 
portion of the lambdoid phage tail protein as a means 
for linking gene-product and gene during the assembly 
stage of lambdoid phage replication. 
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That is, during lambdoid phage replication, tail 
proteins assemble into a matrix which forms the 
tubular tail. The tubular tail is attached to a head 
which encapsulates the phage genome. It has now been 
discovered that (1) phage assembly is not disrupted 
when recombinant lambdoid phage tail proteins are 
present, (2) recombinant lambdoid phage tail proteins 
can be integrated into the assembling matrix, and (3) 
integration into the matrix can be directed to occur 
in a surface-accessible orientation. 

The present invention can be advantageously 
applied to the production of biologically active 
multimeric polypeptides of predetermined activity, 
i.e., it can be used to produce enzymes, lectins, 
ligands, receptors, and the like that have a 
preselected activity. 

Thus, the present invention provides for linking 
the functions of biologically active monomeric or 
multimeric polypeptides and lambdoid phage replication 
in a method for isolating a biologically active 
multimeric polypeptide and the gene that encodes the 
polypeptide. The method produces a lambdoid phage 
comprised of a matrix of tail-encoded proteins that 
form a tubular tail attached to a matrix of head- 
encoded proteins that form an icosahedral head. The 
recombinant genome, encapsulated within the matrix of 
head-encoded proteins, contains genes encoding the 
biologically active multimeric polypeptide. The 
biologically active multimeric protein is surface- 
integrated into the encapsulating matrix via a portion 
of the lambdoid tail protein that is fused by a 
peptide bond during translation to a monomer of the 
multimeric polypeptide. The multimeric polypeptides 
and the genes which encode the polypeptides are thus 
physically linked during the assembly stage of the 
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phage replication cycle. Specific biological activity 
of the enzyme- or lectin-coated phage advantageously 
provides a means for isolating a recombinant genome 
that encodes a desired multimeric polypeptide. 

Thus in one embodiment, the invention describes a 
recombinant lambdoid bacteriophage vector comprising a 
nucleotide sequence that (i) defines the lambdoid 
elements for replication and packaging of the vector 
into an assembled bacteriophage, and (ii) encodes a 
conditionally suppressible cistron for expression of a 
tail protein and a fusion protein that comprises: 

a) a promoter for transcribing the cistron, 

b) a first upstream translatable sequence that 
encodes a lambdoid bacteriophage tail polypeptide, 

c) a first ribosome binding site to initiate 
translation of the upstream translatable sequence, 

d) a second translatable sequence operatively 
linked downstream to the first translatable sequence 
that (i) encodes a linker polypeptide in frame with 
the tail polypeptide and (ii) includes a sequence 
adapted for ligation of an insert polynucleotide that 
defines a third translatable sequence downstream from 
the second translatable sequence that encodes a 
preselected polypeptide, and 

e) a suppressor termination codon within the 
second translatable sequence that upon suppression 
results in read-through to form a fusion polypeptide 
consisting of the tail polypeptide, linker polypeptide 
and preselected polypeptide. 

The vector can optionally further include a 
nucleotide sequence that defines a second ribosome 
binding site to initiate translation of the third 
translatable sequence, thereby allowing the vector to 
also express the preselected polypeptide as a free 
protein. 
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The invention further describes a recombinant 
lambdoid bacteriophage particle comprising a matrix of 
proteins encapsulating a lambdoid genome encoding a 
fusion protein, wherein the matrix includes the fusion 
protein surface accessible in the matrix, and the 
fusion protein consists essentially of, in the 
direction of amino terminus to carboxy terminus, a 
lambdoid bacteriophage tail polypeptide, a linker 
polypeptide and a preselected polypeptide. In 
preferred embodiments, the genome in the bacteriophage 
further encodes a heterologous protein capable of 
forming a multimeric protein complex with the fusion 
protein in said matrix. 

The invention further describes a fusion protein 
having an amino acid residue sequence that comprises, 
in the direction of amino terminus to carboxy 
terminus, a lambdoid bacteriophage tail polypeptide, a 
linker polypeptide and a preselected polypeptide 
defining a biological activity. 

Libraries of recombinant lambdoid bacteriophage 
particles are also described wherein each particle 
contains a recombinant lambdoid bacteriophage vector 
as described hereinabove. 

The invention describes a method producing a 
recombinant lambdoid bacteriophage of this invention, 
comprising the steps of : 

a) infecting an E . coli host strain having 
a termination codon suppression phenotype with a 
recombinant lambdoid bacteriophage vector described 
above ; and 

b) culturing the infected host strain 
under bacteriophage growth conditions to produce the 
recombinant lambdoid bacteriophage. Preferably, the 
host strain is a suppressor strain selected from the 
group consisting of EQ166, CA168 and MC8 . 
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Also described is a method for detecting the 
presence of a preselected target in a sample 
comprising the steps of: 

a) admixing a sample containing the 
preselected target with a recombinant lambdoid 
bacteriophage described above, wherein the preselected 
polypeptide defines a biologically active ligand or 
receptor able to bind the preselected target, under 
binding conditions sufficient for the target -binding 
bacteriophage to bind the target and form a target - 
ligand or receptor complex; 

b) detecting the presence of the complex, 
and thereby the presence of the preselected target. 

Other embodiments are apparent from the detailed 
disclosures that follow* 

Brief Description of the Drawings 

In the drawings forming a portion of this 
disclosure : 

Figure 1 illustrates a flow chart of the 
construction of the yV'sac vector. Also indicated is 
the partial nucleotide and amino acid sequence of the 
yV'sac vector. The amino acid sequence is given in 
SEQ ID NO 9. The top strand of the nucleotide 
sequence from left to right is given in SEQ ID NO 10. 
The bottom strand of the nucleotide sequence from left 
to right is given in SEQ ID NO 11. The left arm of 7 
phage is depicted with genes Nul through J shown as 
boxes. The arrow at the top of the figure indicates 
the number of kilobase pairs (kb) of the y vector. 
The DNA segment from nucleotide 5505 to 10,325 and 
encoding genes Nu3 through G was removed from the y 
vector by digestion with the enzymes BamHI and Nsil. 
This fragment is shown as a line with nucleotide 
numbers below the topmost y genetic map. Insertion of 
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the DNA fragment into the M13mpl9 and pUC12 vectors, 
site -directed mutagenesis to introduce the amber stop 
codon and Sac I restriction site, and subsequent 
combination with the 72OOO vector to create the 7V' sac 
5 vector are as described in Example lal) . A portion of 

the resulting 7V' sac vector is shown at the bottom of 
the figure consisting of the amino acid and nucleotide 
sequence of part of the mutagenized V gene. The 
nucleotide positions which were mutagenized are 

10 underlined and a newly created Sac I site is indicated 

in boldface type. The amber suppressor stop codon is 
indicated by the nucleotide sequence TAG which is 
underlined. Numbers at the left side from top to 
bottom indicate the amino acid number of the pV gene 

15 and the nucleotide sequence from the left arm of 7, 

respectively. Abbreviations used to indicate the 
corresponding restriction sites are: Ba, BamHI; Be, 
Bell; Aa, Aatll; Ap, Apal; N, Nsil; E, Ecori : and 
ApL, ApaLI. 

20 Figure 2 illustrates partial sequence of the 

7blue and 7foo vectors. The partial sequences 
illustrate the amino acid and nucleotide sequence 
between the amber stop codon of the pY protein, 
indicated by TAG and underlined, and the downstream 

25 restriction sites. The names of the restriction sites 

are indicated below the sites and the nucleotide 
sequences defining the restriction sites are 
underlined. The amino acid sequence is given above 
the nucleotide sequence for 7blue and 7foo. The 7blue 

3 0 amino acid and nucleotide sequences are given as SEQ 

ID NOs 14 and 15, respectively. The 7foo amino acid 
and nucleotide sequences are given from amino acid 
residues 178 to 237 of SEQ ID NO 6 and base number 547 
to 726 of SEQ ID NO 5, respectively. The amino acid 

35 sequence downstream of the suppressor codon is 



- 8 - TSRI 432.0 

expressed at the carboxy terminus of pV in the 
presence of a .suppressor tRNA as described in Example 
la2. Numbers in the right-hand column, from top to 
bottom, indicate the amino acid position from the 
start codon of the pV gene and the nucleotide sequence 
of the y vector, from the beginning of the left arm, 
at the end of the respective lines. 

The Pro-Thr linker sequence in the yfoo vector 
(from base number 562 to 624 and amino acid residue 
sequence from 183 to 203 of SEQ ID NO 5) is directly 
downstream of the Sfil restriction site. The DNA 
expression control sequences for expressing 
translatable DNA inserted into any of the downstream 
restriction sites are included in the linker sequence. 
The Pribnow box, CAGGAA, is double underlined and is 6 
nucleotides upstream of the start codon, methionine 
(M) . The second amino acid, threonine (T) , is 
underlined. 

Figure 3 illustrates the detection of y phage 
proteins by gel electrophoresis and Western blotting. 
Purified phage proteins were detected on a 
polyacrylamide gel by staining with PAGE blue 84 (BDH) 
(lanes A, B, and C on the left side) and by reactivity 
with a mouse anti-S-gal antibody (lanes A, B, and C on 
the right side) as described in Example 2a4) . Lane A 
is -yfoo phage proteins when grown on MC8 (su + ) ; lane 
B is y-gal phage proteins when grown on EQ16 6 (su*) ; 
and lane C is yS-gal phage proteins when grown on MC8 
(su + ) . Molecular weight standards (GIBCO/BRL) 
(unlabeled lane in the center) are myosin heavy chain 
(200 kDa) , phosphorylase b (97.4 kDa) , and bovine 
serum albumin (68 kDa) * 
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Dp tailed Description of the Invention 
A. Definitions 

Amino Acid Residue : An amino acid formed 
upon chemical digestion (hydrolysis) of a polypeptide 
5 at its peptide linkages. The amino acid residues 

described herein are preferably in the "L" isomeric 
form. However, residues in the "D" isomeric form can 
be substituted for any L-amino acid residue, as long 
as the desired functional property is retained by the 

10 polypeptide. NH 2 refers to the free amino group 

present at the amino terminus of a polypeptide. COOH 
refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. In keeping with 
standard polypeptide nomenclature (described in jL 

15 Biol. Chem. , 243:3552-59 (1969) and adopted at 37 CFR 

§1 . 822 (b) (2) ) , abbreviations for amino acid residues 
are shown in the following Table of Correspondence: 



TABLE OF CORRESPONDENCE 
SYMBOL AMINO ACID 





1-Letter 


3 -Letter 






Y 


Tyr 


tyrosine 




G 


Gly 


glycine 




F 


Phe 


phenylalanine 


25 


M 


Met 


methionine 




A 


Ala 


alanine 




S 


Ser 


serine 




I 


He 


isoleucine 




L 


Leu 


leucine 




T 


Thr 


threonine 




V 


Val 


valine 




. P 


Pro 


proline 




K 


Lys 


lysine 




H 


His 


histidine 


35 


Q 


Gin 


glutamine 



5 
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E 


Glu 


glutamic acid 


Z 


Glx 


Glu and/or Gin 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


T"\ 
U 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



10 — — 

It should be noted that all amino acid residue 
sequences represented herein by formulae have a left- 
to-right orientation in the conventional direction of 
15 amino terminus to carboxy terminus. In addition, the 

phrase "amino acid residue" is broadly defined to 
include the amino acids listed in the Table of 
Correspondence and modified and unusual amino acids, 
such as those listed in 37 CFR 1.822(b) (4), and 

2 0 incorporated herein by reference. Furthermore, it 

should be noted that a dash at the beginning or end of 
an amino acid residue sequence indicates a peptide 
bond to a further sequence of one or more amino acid 
residues or a covalent bond to an amino- terminal group 
25 such as NH 2 or acetyl or to a carboxy- terminal group 

such as COOH. 

Recombinant DNA (rDNA) molecule : A DNA molecule 
produced by operatively linking two DNA segments. 
Thus, a recombinant DNA molecule is a hybrid DNA 

3 0 molecule comprising at least two nucleotide sequences 

not normally found together in nature. rDNA' s not 
having a common biological origin, i.e., 
evolutionarily different, are said to be 
"heterologous" . 



35 
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Vector : A rDNA molecule capable of autonomous 
replication in a cell and to which a DNA segment, 
e.g., gene or polynucleotide, can be operatively 
linked so as to bring about replication of the 
5 attached segment. Vectors capable of directing the 

expression of genes encoding for one or more 
polypeptides are referred to herein as "expression 
vectors". Particularly important vectors allow 
cloning of cDNA (complementary DNA) from mRNAs 

10 produced using reverse transcriptase. 

Fusion Polypeptide : A polypeptide comprised of 
at least two polypeptides and a linking sequence to 
operatively link the two polypeptides into one 
continuous polypeptide. The two polypeptides linked 

15 in a fusion polypeptide are typically derived from two 

independent sources, and therefore a fusion 
polypeptide comprises two linked polypeptides not 
normally found linked in nature. 

Upstream : In the direction opposite to the 

20 direction of DNA transcription, and therefore going 

from 5' to 3' on the non-coding strand, or 3' to 5' on 
the ttiRNA. 

Downstream : Further along a DNA sequence in the 
direction of sequence transcription or read out, that 
25 is traveling in a 3'- to 5 7 -direction along the 

non-coding strand of the DNA or 5'- to 3 ' -direction 
along the RNA transcript. 

Cistron: Sequence of nucleotides in a DNA 
molecule coding for an amino acid residue sequence and 
3 0 including upstream and downstream DNA expression 

control elements . 

Tricistron : Sequence of nucleotides in a DNA 
molecule coding for three amino acid residue sequences 
and including upstream and downstream DNA expression 
3 5 control elements. 
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Reading Frame : Particular sequence of contiguous 
nucleotide triplets (codons) employed in translation. 
The reading frame depends on the location of the 
translation initiation codon. 

Suppressor Sensitive Mutations : Conditionally 
lethal mutations including sup pressor sensitive (sus) 
mutations, also known as nonsense mutations. The 
mutational events leading to sus generate any of three 
nonsense codons: UAG for the amber mutations, UAA for 
the ochre mutations, and UGA for the opal mutations . 

Suppressor Strains : Bacterial host strains 
generally used when working with sus mutations are a 
nonpermissive strain that does not allow the 
expression of the sus mutation ( su~ ) and a permissive 
or suppressor strain that does allow the expression of 
the sus mutation (su + ) . These suppressor strains 
usually contain a mutation in a tRNA gene so that a 
new species of tRNA is produced which is able to give 
sense to an otherwise nonsense codon. In other words, 
to insert an amino acid at what would otherwise be a 
protein chain termination codon. 

Homomeric : A complex comprised of at least one 
or more identical polypeptides. 

Heteromeric : A complex comprised of at least two 
or more non- identical polypeptide. 

Monomeric : A protein comprised of a single 
polypeptide . 

Dimeric : A complex comprised of two identical or 
nonidentical polypeptides. 

Tetrameric : A complex comprised of four 
identical or nonidentical polypeptides. 

Multimer : A complex comprised of at least two or 
more polypeptides. The multimer may be homomeric or 
heteromeric. The multimer may be dimeric or 
tetrameric. The multimer may thus comprise homomeric , 
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homodimeric, homotetrameric , heterodimeric , or 
heterotetrameric polypeptides. 

Cis: Cis refers to when the phage genome 
contains a second cistron for the expression of 
5 heterosubunits, (1) other than the fusion protein 

subunit or (2) supplementing the amounts of fusion 
protein subunit; and for supplemental expression of 
the homomeric subunit in soluble form. 

Trans: Trans refers to when the an independent 
10 genome contains a second cistron for the expression of 

heterosubunits, (1) other than the fusion protein 
subunit or (2) supplementing the amounts of fusion 
protein subunit; and for supplemental expression of 
the homomeric subunit in soluble form. The 
15 independent genomes comprise episome, plasmid, or 

helper phage genomes, or the second cistron may be 
integrated into the host genome. 

B. Lambda Bacteriophage Di splaying Fusion 

Proteins 

20 The present invention contemplates a 

lambdoid bacteriophage (phage) comprising a matrix of 
proteins encapsulating a genome encoding a fusion 
protein capable of forming a ^iologically active 
mul timer ic polypeptide) The phage further contains a 

25 biologically active multimeric protein comprised of 

monomeric polypeptides surf ace- integrated into the 
matrix via a portion of a lambdoid phage tail protein 
fused to at least one monomer of the biologically 
active multimeric protein. The biologically active 

3 0 multimeric protein has the capacity to have catalytic 

activity and therefor is referred to as an 
enzymatically active complex or enzyme. 
Alternatively, the biologically active multimeric 
protein has the capacity to bind a ligand or a 

35 receptor and therefor can be referred to as 
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alternatively as a ligand-binding complex or receptor, 
or a ligand. 

The biologically active multimeric protein in a 
preferred embodiment is an enzymatically active 
complex. That is, a complex of polypeptides capable 
of enzymatic activity. For example, the polypeptides 
are S-gal monomeric polypeptides. 

Alternatively, the biologically active multimeric 
protein in a preferred embodiment is a ligand-binding 
complex. That is, a complex of polypeptides capable 
of binding a ligand. For example, the polypeptides 
are BPA monomeric polypeptides. 

The polypeptides are capable of autogenous 
assembly into an biologically active complex, which is 
then expressed on the outer surface of the lambdoid 
phage particle in a manner accessible to substrate, 
i.e., they are surface-integrated into the phage. 
Thus, a biologically active complex is typically 
present on the surface of a phage of this invention. 
Typically, the biologically active complex is 
comprised of (i) a fusion polypeptide which comprises 
in the direction of amino terminus to carboxy 
terminus, a lambdoid bacteriophage tail (matrix 
anchor) polypeptide, a linker polypeptide, and a 
preselected polypeptide defining a subunit of the 
biologically active complex; and (ii) one or more 
wild-type monomeric polypeptides of the biologically 
active complex. Preferred are phage having a portion 
of the pV tail matrix anchor domain fused to a 
monomeric polypeptide of the multimeric complex as 
described further herein. 

Alternatively, the polypeptides are capable of 
autogenous assembly into a multimeric complex, which 
is then expressed on the outer surface of the lambdoid 
phage particle in a manner accessible to ligand, for 
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example in the case of a ligand-binding complex, they 
are surf ace -integrated into the phage. Thus, a 
multimeric complex of this invention is present on the 
surface of a phage of this invention. Typically, the 
5 ligand-binding complex is comprised of (i) a fusion 

polypeptide which comprises a lambdoid phage tail 
matrix anchor, a linker polypeptide, and a monomer of 
the ligand-binding complex; and (ii) one or more 
wild- type monomers of the ligand-binding complex. 

10 Because the enzymatically active complex is 

linked to the phage in a surface accessible manner, 
the phage can be advantageously used as a solid-phase 
catalyst. In preferred embodiments, the phage are 
linked, preferably removably linked, to a solid 

15 (aqueous insoluble) matrix such as agarose, cellulose, 

synthetic reins, polysaccharides and the like. For 
example, the phage can be applied to and retained in a 
column and maintained under conditions that support 
retention of the phage. An aqueous composition 

20 containing a substrate that is catalyzed by the 

enzymatically active complex expressed by the phage is 
then passed through the column at a predetermined rate 
and under substrate-enzyme binding conditions to form 
a solid-phase substrate -enzyme complex. The column is 

25 then maintained under conditions favorable for the 

catalytic conversion of substrate to product or 
products. The product or products can then be removed 
and recovered by washing the column with a buffer that 
promotes elution of the product or products. 

3 0 Alternatively, purified phage can be admixed with 

a' aqueous solution containing the substrate to be 
catalyzed. The enzyme/substrate binding reaction 
admixture thus formed is maintained for a time period 
and under binding conditions sufficient for a phage- 

35 linked enzyme -substrate complex to form and under 
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catalytic conditions sufficient for catalysis to 
occur. The phage-bound enzyme (enzyme-bearing phage) 
are then separated and recovered from the product or 
products, such as by centrif ugation, electrophoresis, 
precipitation, and the like. 

Alternatively, because the ligand-binding complex 
or receptor is linked to the phage in a surface 
accessible manner, the phage can be advantageously 
used as a solid-phase affinity sorbent . In preferred 
embodiments, the phage are linked, preferably 
removably linked, to a solid (aqueous insoluble) 
matrix such as agarose, cellulose, synthetic reins, 
polysaccharides and the like. For example, the phage 
can be applied to and retained in a column and 
maintained under conditions that support retention of 
the phage. An aqueous composition containing a ligand 
that binds to the receptor expressed by the phage is 
then passed through the column at a predetermined rate 
and under ligand-binding conditions to form a solid- 
phase receptor- ligand complex. The column is then 
washed to remove unbound material, leaving the ligand 
bound to the solid-phase phage. The ligand can then 
be removed and recovered by washing the column with a 
buffer that promotes dissociation of the rec.eptor- 
ligand complex. 

Alternatively, purified phage can be admixed with 
a aqueous solution containing the ligand to be 
affinity purified. The receptor/ligand binding 
reaction admixture thus formed is maintained for a 
time period and under binding conditions sufficient 
for a phage-linked receptor- ligand complex to form. 
The phage-bound ligand (ligand-bearing phage) are then 
separated and recovered from the unbound materials, 
such as by centrif ugation, electrophoresis, 
precipitation, and the like. 
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Phage of this invention can be labeled when used 
in a diagnostic method of this invention. Preferred 
labels include radioactively labeled nucleic acids 
incorporated into the phage genome, or radioactively 
5 labeled amino acids incorporated into protein 

components of the phage particle. Preparation of 
labeled phage can be routinely prepared by growing 
phage as described herein, but including radiolabeled 
nucleotides or radiolabeled amino acids in the culture 

10 medium for incorporation into nucleic acids or 

polypeptides of the phage, respectively. Exemplary 
labels are 3 H-thymidine or 35 S-methionine . Other 
isotopic labels and other nucleotide or amino acid 
precursors are readily available to one skilled in the 

15 art. The labeled phage preferably contains sufficient 

label to be detectable in a binding assay of this 
invention, i.e., the phage is detectably labeled. 

1 . Lambdoid Bacteriophage Classifi cation 

2 0 and Structure 

Lambdoid bacteriophages are a group of * 
related viruses that infect bacteria. They are termed 
lambdoid, or "lambda-like", because the first member 
to be described was lambda (X) . The lambdoid 
25 bacteriophage (phage) are members of a group which 

have three common properties: the ability to 
recombine when intercrossed, DNA molecules possessing 
identical pairs of cohesive ends, and prophages that 
are inducible by ultraviolet irradiation. The best 

3 0 known members of the lambdoid group comprise X, 21, 

080, 081, 82, 424/ and 434. 

2 . Fusion Display Protein 

Lambdoid phage particles are about half 
35 protein and half DNA. The mature capsule of lambdoid 
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phage is comprised of a morphologically distinct head 
and tail. Each phage particle contains one double- 
stranded DNA molecule capable of replication that is 
encapsulated in an icosahedral head from which 
projects a tubular tail. 

The bacteriophage head and tail are assembled 
separately and then together to form a phage particle 
capable of infecting and replicating in a suitable 
host . The bacteriophage head encapsulates the phage 
genome. The steps of bacteriophage X head and tail 
morphogenesis are complex and proceed in an obligate 
order. Details of head and tail morphogenesis are 
described fully in: "Lambda II", R. W. Hendrix, J. W. 
Roberts, F * W. Stahl, and R, A. Weisberg, " eds . Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New 
York, pp. 279-304 and 331-363 (1983), respectively. 

The finished lambdoid head contains amino acid 
residue sequences encoded by six of the twelve genes 
required for its assembly. The assembled lambdoid 
head thus contains gene products of six genes (W, B, 
C, D, E, and FID and eight types of polypeptides (pE, 
pUt pB, pW, p FII , pB* [pB* refers to a cleavage product 
of pB] , pXI , and pX2) . The mature head of X phage 
contains 420 molecules of each of two major proteins, 
the amino acid sequences encoded by pD and pE {Imber 
et al., J * Mol . Biol . , 139:277, 1980}. The mature 
head forms an icosahedral shape* The inner portion of 
the icosahedral head is a shell composed of only pE. 
pD occupies parts of the valleys on the surface of the 
pE shell (Georgopoulous et al . , J. Mol . Biol . , 76:45, 
1983) . The remaining proteins are a part of the 
proximal vertex, or 11 head-tail !! connector, located at 
the 12 vertices of the head. Thus, eight types 
polypeptides are assembled on the surface of the 
lambda head and provide the means for display of a 
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foreign polypeptide in a surface-accessible manner on 
the surface of the lambdoid phage particle. 

Twelve tail genes have been reported; Z, U, V, 
G, T, H, " 208 " , M, L, K, I, and J {Campbell, et al . , 
Virology , 14:22, 1961; Thomas et al . , Mutat . Res . , 
4:735, 1967/ Mount et al . , Virology , 35:134, 1968; 
Parkinson, Genetics , 59:311, 1968). Most of the tail 
gene products are found in assembled phage particles. 

The lambdoid phage tail is a thin flexible tube, 
ending in a small conical part, and a single tail 
fiber (Kellenberger et al . , in: " The ba cteriophage 
lambda", A. D. Hershey, ed. , Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, p. 2 71, 
1971; Hohn et al . , Curr . Top . Microbiol . Immunol . , 
78:69, 1977). Both the conical part and the tail tube 
have cross striations in electron micrographs and 
consist of 3 or 4 disks and 32 disks, respectively. 
The tail tube can be dissociated into ringlike 
structures by chemical treatments. Each ringlike 
structure corresponds to single- or multi- layered 
disks (Bleviss et al . , Can. J. Microbiol. , 17:947, 
1971; Katsura et al . , J . Supramol . Struct . , 2:239, 
1974) . An end view shows that there are probably six 
small knobs arranged around the core. The disk is 
composed of six subunits of pV (Buchwald et al . , 
Virology , 42:390, 1970; Casjens et al . , J . Mol . 
Biol . , 88:535, 1974), each subunit of pV consisting of 
two separate folding domains . 

Most of the tail -gene products are found in 
finished phage particles and their locations in the 
structure are known with varying degrees of certainty, 
pj is the part of the tail that interacts directly 
with the surface of the host cell, as judged by 
serological (Dove, J. Mol . Biol . , 19:187, 1966; 
Buchwald et al . , Virology , 38:1, 1969) and genetic 
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(Mount et al . , Virology * 35:134, 1968) evidence. 
Thus, the pJ is thought to be the tail fiber. The 
major tail protein, pV, forms the tail tube which 
comprises a stack of 32 hexameric rings (Casjens et 
al., J. Mol . Biol . . 90:20, 1974; Katsura et al., 
Virolocry , 76:129, 1977). A hexamer of pU is attached 
to the head-proximal end of the tail tube and connects 
the tail to the head. pU and pZ are believed to be 
inside the tubular tail. pG, pM, pL, and pT are a 
part of the tail tip which forms the conical portion 
plus the tail fiber. Thus, pJ, pV, pG, pM, and pT are 
assembled on the surface of the lambda tail and 
provide a preferred means for display of a foreign 
polypeptide in a surf ace -accessible manner on the 
surface of the lambdoid phage particle. 

The matrix anchor domain of a lambdoid phage tail 
protein is the amino terminal region of the tail 
protein and is also known as the core domain. The 
core domain and the protruding domain of the pV 
polypeptide seem to be folded into more or less 
independent domains . Of the two domains , the core 
domain is responsible for the maintenance of the total 
shape of the tail and for the injection of phage DNA 
into the host cell. Therefore, the core domain is 
responsible for the assembly and function of the 
infectious phage particles (Katsura, J. Mol . Biol . . 
146:493-512, 1981). Preferred tail proteins are found 
in the tail proteins encoded by pV. Mutants of pV 
have been isolated in which as much as one third of 
the pV molecule is absent from the carboxy terminal 
end, yet the tails function normally. In these 
mutants, the domain of pV that constitutes the outer 
knob is missing (Katsura, J.- Mol . Biol . , 146:493-512, 
1981) . 
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The amino acid residue sequence of a preferred 
matrix anchor domain is derived from the X phage gene 
V tail protein (also designated pV) and has an amino 
acid residue sequence shown in SEQ ID NO 1 from 
residue 1 to residue 246. Exemplary matrix anchors 
would consist of at least residues 1 to 177, or 
residues 1 to 23 0 of pV. Gene V tail protein is 
present on a mature phage along the phage tail with 
typically about 180 to 200 copies of the tail protein. 

A phage particle of this invention contains at 
least one fusion protein of this invention on the 
surface of the phage particle. The actual amount of 
fusion protein present on the surface of a phage 
particle depends, in part, on the choice of protein 
matrix anchor present in the fusion protein. Where 
the anchor is derived from pD, there is the potential 
for hundreds of fusion proteins depending on growth 
conditions and other factors. Where the anchor is 
derived from the more preferred pV, there are 
typically 1 to 4 fusion proteins on the particle 
surface depending on the growth conditions and other 
factors as discussed herein. Preferably, a phage 
particle contains from about 1 to about 4 pV-derived 
fusion proteins on the surface of each particle, and 
more preferably about 1 to 2 fusion proteins per 
particle. Exemplary amounts of surface fusion protein 
are shown by the electron micrographs described in 
Example 2a6) that describe particles having about 1 to 
3 pV-derived fusion proteins per particle. The 
remaining pV tail proteins are either wild- type pV 
tail proteins or the pV amino terminal domain tail 
proteins. 

The assembly of the X tail during phage 
morphogenesis is regulated by variation in the 
relative amount of each tail protein synthesized, the 
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timing of the production of each tail protein, and the 
regulation of protein polymerization both by- 
initiation and by termination. 

For further detailed descriptions of the 
structure of lambdoid phage particles, their coat and 
tail proteins and particle assembly, see Georgopoulos , 
et al. and Katsura, in: "Lambda II", R. W. Hendrix, J. 
W. Roberts, F. W. Stahl, and R. A. Weisberg, eds . Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New 
York, pp. 279-304 and 331-363, 1983, respectively. 

a . Phage Display Proteins 

The protein matrix anchor provides 
a means for display of a foreign protein on the 
surface of the phage particle of this invention in a 
surface accessible manner. The protein matrix anchor 
may be selected from the group consisting of any of 
the proteins displayed on the surface of the phage 
particle and comprise the head proteins pE, pD, pB, 
pW, p FIl , pB* [pB* refers to a cleavage product of pB] , 
pXI , and p X2 and the tail proteins pJ, V, G, M fj and T. 

The gene product of pV is the major tail protein 
and forms the tail tube (Buchwald et al . , Virology , 
42:390, 1970) as described herein above. The gene 
product of pV is present on the surface of the phage 
tail at about 192 copies . pV contains 246 amino acid 
residues (SEQ ID NO 1) . 

A member of the lambdoid phage, phage lambda, has 
been described having a mutation in the pV in which as 
much as one third of the pV protein is absent from the 
carboxyterminal end, yet the tails appear to function 
normally. The domain of pV that constitutes the outer 
knob is missing in these mutants (Katsura, J . Mol . 
Biol . , 146:493-512, 1981). The mutant, or truncated, 
pV comprises a protein that is from about amino acid 
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residue sequence 1 to 176 (SEQ ID NO 1 from positions 
1 to 176) . In preferred embodiments, the pV 
polypeptides will generally have fewer than 24 0 amino 
acid residues, more usually fewer than about 23 0 amino 
5 acid residues, while normally having greater than 150 

amino acid residues, usually greater than about 17 0 
amino acid residues, more usually greater than about 
180 amino acid residues. Preferably, the pV will be 
from about 185 to about 220 amino acid residues in 

10 length. pV polypeptides of sufficient length to form 

a core domain capable of assembling and forming 
infectious phage particles are preferred. pV 
polypeptides of sufficient length and/or structure to 
display the fusion protein, which has been 

15 incorporated into the phage tail matrix, in a surface 

accessible manner are also preferred. 

The truncated pV proteins in the mutant X 
described herein were synthesized and assembled onto 
phage particles in amounts of about 192 copies per 

20 particle, a density at which wild-type pV is usually 

found. 

The ratio of wild-type to truncated pV protein, 
truncated pV protein to fusion protein, and wild-type 
pV protein incorporated into the phage tail matrix may 

25 be critical to the formation of functional (i.e., 

infectious) phage particles. This ratio can be varied 
by several means for expressing the polypeptides by 
the use of transcriptional and translational 
regulators, varying the copy number of the genes, and 

30 the like as well known by those of skill in the art. 

Preferred ratios of fusion protein to truncated 
pV protein are those which result in the formation of 
functional phage particles. Preferred ratios of 
fusion protein to truncated pV protein are about 1 to 

35 10 fusion proteins to 200 truncated pV protein, more 
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preferred ratios are about 1 to 5 to 200, and most 
preferred ratios are about 1 to 192. 

b. Displayed Proteins /Epitopes 
5 A fusion protein in a 

bacteriophage of this invention typically displays a 
preselected polypeptide which defines a cytoplasmic 
protein or a biologically active portion of the 
cytoplasmic protein. Cytoplasmic proteins are those 

10 proteins which fold and assemble in the cytoplasm. 

Cytoplasmic proteins carry out a particular function 
which may be structural, enzymatic, or regulatory. 
Due to the broad applicability of the invention, the 
'species of cytoplasmic protein to be displayed need 

15 not be limited. Preferred biological activities 

include enzymes, ligands, receptors, co- factors, 
structural proteins, and the like. 

The replication and assembly of lambdoid phage 
occurs in the cytoplasm, therefore, a fusion protein 

20 consisting of a phage tail polypeptide and a 

cytoplasmic protein would be assembled into the 
lambdoid phage matrix in the cytoplasm and does not 
require that the fusion protein be secreted prior to 
assembly. 

25 The displayed polypeptide of this invention may 

be the complete naturally-occurring polypeptide or a 
portion of the naturally-occurring polypeptide. If a 
portion of the polypeptide is displayed, the portion 
that confers the desired biological activity or 

3 0 function would be displayed. The portion of the 

polypeptide may define a structural or functional 
domain. 

The displayed polypeptide of this invention may 
be a monomeric or multimeric protein, that is, the 
35 displayed polypeptide may comprise one or more 
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subunits. The displayed multimeric protein can be 
homomeric or heteromeric, that is, the displayed 
multimeric protein may comprise more than one 
identical or nonidentical subunits. 
5 An exemplary enzymatic displayed polypeptide of 

this invention is S-galactosidase (S-gal) . fi-gal is a 
tetramer of four identical monomers and is therefore a 
homotetramer . The tetramer of S-gal has a molecular 
mass of 465 kDa . S-galactosidase activity is 

10 associated with the tetrameric structure. S- 

galactosidase activity is not associated with either 
dimer or monomers of S-gal. The gene for fi-gal has 
been cloned and sequenced, and is well known and 
readily available to the public. The sequence has 

15 been published, and is also available on public 

databases such as GenBank. S-gal cleaves a S- 
galactoside into its component sugars and is most 
commonly used to cleave lactose to form glucose and 
galactose. S-gal has been well characterized and 

20 further descriptions and details can be found in: 

" The Ooeron " , J. H. Miller and W. S. Reznikoff, eds . , 
Cold Spring Harbor Laboratory, pp. 89-121 (1980). It 
is also widely used commercially due to its ability to 
function as a sensitive indicator in recombinant DNA 

25 vectors. The nucleotide sequence encoding a monomer 

of S-gal was inserted into the DNA expression vector 
of this invention and is described in Example 2. The 
first amino acid residue of the S-gal monomer is the 
most preferred site of fusion to the pV protein, 

30 however, other sites of fusion may also be used, so 

long as the fusion protein has enzymatic activity. 

Another exemplary displayed polypeptide of this 
invention is Bauhinia purpurea lectin (BPA) . BPA is a 
tetramer of four identical subunits and is therefore a 

3 5 : homotetramer. The tetramer of BPA has a molecular 
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mass of 12 0 kDa. BPA has a low association constant 
of 10 5 M" 1 with mucin (Osawa et al . , Ann . Rev . 
Biochem. , 56:21-42, 1987). 

BPA is a plant lectin that can be purified from 
B . purpurea seeds (Irimura et al . , Arch. Biochem. 
Biophvs . , 151:475-482, 1972). Lectins are proteins 
with carbohydrate-binding sites which have the ability 
to agglutinate erythrocytes. BPA exhibits a high 
degree of specificity for N-acetylgalactosamine and 
galactose and has been shown to bind specifically to 
S-D-galactose residues, especially to Galfil-3-N- 
acetylgalactosamine . Lactose and galactose have been 
shown to competitively inhibit binding of BPA to mucin 
while maltose and glucose have not. 

The cDNA for BPA was recently isolated from a 
cDNA library of germinated B . purpurea seeds (Kusui et 
al., J. Biochem. , 109:899-903, 1991). The cloned BPA 
cDNA comprised 1,152 nucleotides and the open reading 
frame of the cDNA encodes a polypeptide of 290 amino 
acid residues including a signal peptide composed of 
28 amino acid residues. The cDNA containing amino 
acid residue sequences encoding the mature lectin 
protein without the signal peptide was inserted into 
the DNA expression vector of this invention as 
described in Example 3. The first amino acid residue 
of the mature BPA monomer without the signal peptide 
is the most preferred site of fusion to the pV 
protein, however, other fusion sites may also be so 
long as the fusion protein has lectin binding 
activity. 

c . Linker Polypeptides 

The invention uses a linker 
polypeptide as a means for linking the upstream and 
downstream polypeptides, i.e., the phage matrix anchor 
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(tail) and the displayed (biologically active) 
polypeptides. The linker polypeptide provides a 
polypeptide which facilitates the independent folding 
and spatial separation of the upstream and downstream 
5 polypeptides. In addition, the invention uses a 

linker polypeptide as a means for the physical 
separation of the upstream and downstream 
polypeptides. The linker polypeptide can optionally 
provide a proteolytic cleavage site capable of being 

10 cleaved by a protease and thereby allowing a means for 

physically separating the upstream and downstream 
polypeptides. The protease cleaves the peptide bonds 
at a specific site in the linker polypeptide hereby 
eliminating the physical linkage of the upstream and 

15 downstream polypeptides. Thus, cleavage of the linker 

polypeptide at a specific site provides a means for 
the physical separation of the upstream and downstream 
polypeptides . 

The linker peptide of the present invention. 

20 preferably consists of amino acid residue sequence of 

sufficient length or size to confer a spatial 
separation between the upstream and downstream 
polypeptides and allow efficient incorporation of the 
upstream and downstream polypeptides into the lambdoid 

25 phage tail matrix. The linker peptide is typically 

about 10 to 100 amino acid residues in length. 
Preferably, the linker peptide is 15 to 50 amino acid 
residues. More preferably, the linker peptide is 20 
to 40 amino acid residues. An exemplary linker having 

30 3 6 amino acid residues is described in the Examples 

and has a sequence from amino acid residues 178 to 213 
of SEQ ID NO 6 . 

In addition, the amino acid residue sequence of 
the linker polypeptide confers a flexibility upon the 

35 linker polypeptide. The flexibility contributes to 
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the independent folding and spatial separation of the 
upstream and downstream polypeptides and efficient 
incorporation of the upstream and downstream 
polypeptides into the lambdoid phage tail matrix. The 
flexibility conferred by the linker polypeptide is an 
inherent property of the amino acid residue sequence 
comprising the linker polypeptide. 

Preferably, the linker polypeptide consists of an 
amino acid residue sequence that is selected from the 
group comprising the hinge region of an immunoglobulin 
molecule, the Pro-Thr sequence of C. fimi 
endoglucanase and exoglucanase, and alternating Pro- 
Ser amino acid residues. Most preferably, the linker 
polypeptide of the present invention consists of 
alternating prolyl and theonyl amino acid sequence 
residues. Such alternating prolyl and theonyl amino 
acid sequence residues can be found in the Pro-Thr 
sequence of Cellulomonas fimi endo-fi-1, 4 -glucanase 
(Wong, et al . , Gene , 44:315-324, 1986). In the endo- 
£-1,4 -glucanase molecule, the Pro-Thr amino acid 
residue sequence links two domains, the cellulose- 
binding domain and catalytic domain, in C. fimi 
exoglucanase (Cex) and endoglucanase (CenA) , and is 
similar in structure to the hinge -region of IgA 1 
immunoglobulins (Ong et al . , Biotechnology , 7:604-607, 
1989). The linker polypeptide thus provides a means 
for the spatial separation of the upstream polypeptide 
from the downstream polypeptide, the individual 
folding of the upstream and downstream polypeptides, 
and efficient incorporation of the fusion polypeptide 
into the lambdoid phage tail matrix. 

The linker polypeptide, Pro-Thr amino acid 
sequence residue, also provides a means for the 
physical separation of the upstream and downstream 
polypeptides by cleavage of the peptide bond at a 
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specific site by a protease. Preferable proteases for 
use in this invention include C. find protease (Gilkes 
et al., J, Biol. Chem. , 263:10401-10407, 1988) and 
collagenase . 

C. Infectious Phage Particle Structure 

A lambdoid phage particle of this invention 
is produced by standard lambdoid phage particle 
preparation methods and depends on the presence in a 
DNA expression vector of this invention of a lambdoid 
phage tail protein pV as described herein to provide 
the polypeptides necessary for (1) production of an 
assembled lambdoid phage tail and (2) the ability of 
the assembled lambdoid phage tail to bind to and 
infect suitable host cells. Such phage tail proteins 
can be assembled to form an infectious phage particle 
when present in a bacterial cell host upon 
introduction of genetic complementation to provide the 
phage tail proteins required for production of 
infectious phage particles. Such tail proteins can be 
supplied either in cis or in trans from in vivo or in 
vitro sources (see for example, Katsura, in: "Lambda 
II", R. W. Hendrix, J. W. Roberts, F. W. Stahl, and R. 
A. Weisberg, eds. Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York, pp. 279-3 04, 1983. 

A typical and preferred method for genetic 
complementation is to infect a bacterial host cell 
containing a DNA expression vector of this invention 
with a helper lambdoid phage, plasmid, or phagemid 
thereby providing the genetic elements required for 
phage particle assembly. Genetic complementation 
would thus provide the ratio of fusion protein to tail 
protein required to form a functional lambdoid tail 
protein as described herein. Alternatively, genetic 
complementation can provide the fusion protein of this 
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invention to be incorporated into the phage tail 
matrix to form a functional phage tail protein which 
displays the multimeric complex in a surface 
accessible manner. 
5 Lambda phage which have a mutation in the pV have 

been described in which as much as one third of the pV 
molecule is absent from the carboxy terminal end as 
described herein. These lambda phage tails assemble 
and function normally. The truncated pV proteins were 

10 synthesized and assembled onto phage particles in 

amounts of about 192 copies per particle, a density at 
which wild- type pV is usually found. These expressed 
truncated pV proteins, which include the amino acid 
residue sequences from about 1 to 176, do not suggest 

15 that a combination of wild-type and truncated pV 

proteins can be assembled to form a functional phage 
particle. The ratio of wild-type and truncated pV 
proteins required for the formation of functional 
phage particles are also described herein. It is 

20 therefore unexpected that a functional phage particle 

can be assembled with a combination of wild-type and 
truncated pV proteins or a combination of truncated pV 
and fusion proteins. 

25 D. Bacteriophage Genome 

The genome of the lambdoid bacteriophage 
contains all . the elements necessary for replication, 
packaging of the genome, a minimum number of proteins 
able to assemble into an infectious phage particle. 

3 0 The genome of a lambdoid bacteriophage may be divided 

into three regions-. The left-hand region includes all 
of the genes (from A to J) whose products are 
necessary to package phage DNA and produce an 
infectious phage particle. The central region 

35 (between genes J and N) does not contain genes whose 
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products are necessary for the production of 
infectious phage particles. The remaining portion of 
the genome, from N rightward, includes all the major 
control elements, the genes necessary for phage 
5 replication (O and P) , and those for cell lysis (S and 

R) . The lambdoid phage replication pathway produces a 
linear polymer of the lambdoid bacteriophage genome 
which can be replicated and packaged into a X head. 
The X head and tail assemble to form an infectious 

10 phage particle capable of infecting and replicating in 

a suitable host. 

In addition the bacteriophage genome of this 
invention encodes a cistron that expresses the 
biologically active multimeric complex of this 

15 invention in a surface accessible manner on the 

surface of phage particles. The cistron is adapted 
for the insertion of insert DNA and is described 
further herein. The expression of the biologically 
active multimeric complex is under a type of 

20 translational control known as conditional suppression 

and is further described herein. Conditional 
suppression requires the use of a suppressor codon, 
such as TAG, and the use of suppressor and 
nonsuppressor hosts. Expression of the bacteriophage 

25 genes required for replication, packaging, and the 

like are not effected by the use of suppressor and 
nonsuppressor hosts. 

E. Polyvalent Phaae Having Different Di splay 
3 0 Proteins 

The invention also contemplates lambdoid 
bacteriophage particles having multiple species 
(polyvalent) of fusion proteins of this invention, 
thereby providing polyvalent phage particles. A phage 
35 particle having multiple displayed biological 
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activities provides the advantages of polyvalence, 
cross linking, labeling and the like well characterized 
properties, the specifics of which depend upon the 
particular fusion proteins and associated biological 
activities being displayed. 

For example, a bivalent phage may contain a first 
fusion protein that provides a binding activity of 
interest useful for detection of a receptor, and the 
second fusion protein provides a detection means, such 
as a catalytic activity. Thus, the phage serves to 
conjugate two separate biological activities, namely a 
tag and a binding activity. Alternatively, one can 
use a bivalent (or multivalent) phage as a crosslinker 
for coupling two functions, such as ligands for two 
different receptors. Other permutations will be 
apparent to one skilled in the art. 

The presentation of two or more different fusion 
proteins on a phage of this invention can be 
accomplished by a variety of formats, and therefore 
need not be limiting. For example, one can introduce 
two separate biological activities onto two fusion 
proteins which have the same lambdoid matrix anchor, 
or have different anchors. 

In the case of using the same anchor, for example 
pV, a first pV fusion protein is expressed by a first 
cistron, and a second pV fusion protein is expressed 
by a second cistron. There is no particular 
limitation on the location of the first and second (or 
third, etc., if more) cistrons, and can be positioned 
on the same vector and under the control of the same 
or different promoters, or can be positioned on 
separate vectors, for example one on a plasmid and the 
other on the lambdoid vector. 

In the case of using different anchors, for 
example pV and pD, a pV fusion protein is expressed by 
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a first cistron, and a pD fusion protein is expressed 
by a second cistron, with the cistron configurations 
varying as discussed above. 

In another permutation on this theme, one can 
utilize the differences between libraries based on pV- 
and pD-derived membrane anchors due to their inherent 
differences in valency. Because a library of phage 
having the pV-derived membrane anchor will typically 
contain only 1 to 4 copies of the enzyme- or ligand- 
binding complex on the surface of each phage particle, 
the phage presents a binding complex of relatively 
" low" valency, approaching one. In contrast, a 
library of phage having a pD- derived membrane anchor 
will typically contain 20 to 420 copies of the 
substrate- or ligand-binding complex on the surface of 
each phage particle, the particle presents a 
relatively "high" valency. Thus, pV-based libraries 
are referred to as monovalent and pD-based libraries 
are referred to as multivalent. 

F. Polypeptides 

In another embodiment, the present invention 
contemplates a polypeptide (fusion protein) comprising 
an amino-terminal lambdoid matrix anchor polypeptide 
domain, a linker polypeptide domain, and a preselected 
polypeptide domain defining a biological function. 

The polypeptide is a fusion polypeptide having a 
preselected biological activity comprised of an amino 
acid residue sequence that defines the biologically 
active domain of a protein positioned, located at the 
carboxy terminus of the fusion protein. That is, the 
preselected (inserted) polypeptide domain in the 
fusion polypeptide is the biologically active domain 
of a protein and is also referred to as a biologically 
active polypeptide. 
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In one embodiment, a polypeptide of this 
invention has an amino acid residue sequence that can 
be represented by the formula, shown in the direction 
of amino- to carboxy terminus: 
5 (Fl) NH 2 -0-U-V-COOH, 

where O represents an amino acid residue sequence 
defining a lambdoid matrix anchor polypeptide, U 
represents a linker polypeptide, and V represents an 
amino acid residue sequence defining a biologically 

10 active-domain such as a substrate- or ligand-binding 

receptor polypeptide. 

In the formula (Fl) , the lambdoid matrix anchor 
polypeptide and linker polypeptide are as defined 
herein above. Thus, a preferred polypeptide comprises 

15 a preselected polypeptide operatively linked at its 

amino-terminus to the lambdoid matrix anchor 
polypeptide and the linker polypeptide. 

In preferred embodiments the lambdoid matrix 
anchor polypeptide domain is derived from the lambdoid 

20 phage pV or pJ tail proteins or from the lambdoid 

phage pD head protein as described herein. 

In preferred embodiments, the biologically active 
protein is a polypeptide chain of a catalytic 
multimeric protein. More preferably the multimeric 

25 polypeptide is an enzymatic complex. 

In other preferred embodiments, the biologically 
active polypeptide is a polypeptide chain of a lectin 
protein. More preferably the multimeric protein is a 
mucin-binding complex. 

3 0 Preferred multimeric polypeptides include 

enzymes, lectins, and the like. 

As used herein with regard to polypeptides, the 
phrase "operatively linked" means that polypeptide 
fragments, or protein domains represented by 

3 5 polypeptides, have been covalently joined into a 
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single polypeptide polymer, preferably by conventional 
amide bonds between the adjacent amino acids being 
linked in the polypeptide. 

In one embodiment, V is an amino acid residue 
sequence that defines the catalytic domain of a 
multimeric molecule, and preferably is an enzymatic 
polypeptide. Most preferred is a polypeptide where V 
is a S-gal monomeric polypeptide. 

In another embodiment, V is an amino acid residue 
sequence that defines the ligand-binding domain of a 
multimeric receptor molecule, and preferably is a 
lectin polypeptide. Most preferred is a polypeptide 
where V is a BPA monomeric polypeptide. 

In another embodiment, U can define a linker 
polypeptide containing a proteolytic cleavage site, 
such as the sequence of amino acids found in a 
precursor protein, such as C. fimi exoglucanase or 
endoglucanase, collagen, prothrombin, factor X and the 
like, that defines the site of cleavage of the 
polypeptide. A fusion polypeptide having a cleavage 
site provides a means to purify the polypeptide away 
from the phage particle to which it is attached. 

The polypeptide linker U can have any sequence of 
amino acid residues of from about 1 to 50 amino acid 
residues in length. Typically the spacer residues are 
present in a polypeptide to accommodate the continuous 
reading frame that is required when a polypeptide is 
produced by the methods disclosed herein using a DNA 
expression vector of this invention. Further, the 
linker polypeptide U provides a means for the spatial 
separation of the upstream and downstream polypeptides 
and provides for the independent folding of the 
polypeptides . 

Preferably the receptor produced by the subject 
invention is multimeric and is therefore normally 
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comprised of more than one identical polypeptide 
chain, which together assume a conformation having a 
binding affinity, or association constant for the 
preselected ligand that is different, preferably 
5 higher, than the affinity or association constant of 

any of the polypeptides alone or in pairs, i.e., as 
monomers or dimers . The multimeric receptor is 
referred to as a substrate- or ligand-binding 
multimeric receptor to connote its ability to bind 

10 substrate or ligand. 

Thus, a preferred embodiment contemplates a 
ligand-binding multimeric receptor comprising first, 
second, and third polypeptides. The first polypeptide 
is a fusion polypeptide comprising an amino -terminal 

15 lambdoid matrix anchor domain, a linker polypeptide, 

and a preselected polypeptide. The second polypeptide 
is a lambdoid matrix anchor domain. The third 
polypeptide is a wild-type monomer polypeptide. A 
particularly preferred ligand-binding multimeric 

20 receptor contains a matrix anchor-derived from pD, pV 

or pJ as described herein. 

A ligand-binding multimeric receptor is referred 
to as an epitope -binding complex to connote that the 
complex has a capacity to bind an epitope present on a 

25 ligand, and to connote that the multimeric receptor is 

formed by the association (complexation) of more than 
one polypeptide as described herein. 

The polypeptide chains are preferably derived 
from the coding region of an enzyme, lectin, receptor, 

3 0 ligand or other protein having a desirable biological 

activity. Typically, polypeptides comprising the 
monomeric polypeptides are employed together for 
binding a preselected substrate or ligand. 

Thus, one embodiment contemplates a substrate - 

35 binding multimeric receptor in which the polypeptides 
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are S-gal monomeric polypeptides. An alternative 
embodiment contemplates a ligand-binding multimeric 
receptor in which the polypeptides are BPA monomeric 
polypeptides . 

A receptor produced by the subject inventicpn can 
be active in monomeric as well as multimeric forms. 
For example, S-gal substrate binding polypeptide 
produced by the present invention can be 
advantageously combined in the multimer to modulate 
the activity of the monomeric polypeptides or to 
produce an activity unique to the multimer. 
Alternatively, the BPA ligand-binding polypeptide 
produced by the present invention can be 
advantageously combined in the multimer to modulate 
the activity of the monomeric polypeptides or to 
produce an activity unique to the multimer. 

G . Method for Producing Bacteriophage Particle 
Having a Heterologous Cytoplasmic Protein on 
its Surface 

The present invention also describes methods 
for producing proteins on the surface of lambda phage 
particles and for producing recombinant lambdoid 
bacteriophage particles. The methods are based 
generally on the use of a lambdoid expression vector 
which can be produced from a variety of sources of 
lambdoid genomes, as described herein. 

The manipulation methods used to identify 
proteins displayed on the surface of phage particles 
have been extensively described in the literature, and 
will not be reviewed in detail herein, except for 
those features required to use unique embodiments of 
the present invention. However, the methods generally 
involve the use of a lambdoid phage (lambda) surface 
expression vector system for cloning and expressing 
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protein species. 

In one exemplary embodiment, the method involves 
preparing a lambda vector capable of displaying S-gal 
on the surface of its phage particles by using E. coli 
5 S-gal as a source of the S-gal gene. S-gal is a 

multimeric protein which assembles four S-gal monomers 
to form a tetrameric structure. The formation of the 
tetrameric structure is necessary for the functional 
activity of S-gal. 

10 In another exemplary embodiment, the method 

involves preparing a lambda vector capable of 
displaying BPA on the surface of its phage particles 
by using B. purpurea cDNA as a source of the BPA gene. 
BPA is also a multimeric protein which assembles four 

15 BPA monomers to form a tetrameric structure. 

Alternatively, the displayed protein may be a 
monomeric protein. 

The method for producing a biologically active 
multimeric polypeptide complexes generally involves 

20 (1) preparing a polypeptide -encoding gene and 

inserting the gene into a lambdoid bacteriophage 
vector of this invention using genomic DNA or cDNA as 
a source for the DNA inserts, (2) expressing the 
fusion protein containing the polypeptide-encoding 

25 gene in said vectors capable of expressing and 

assembling a multimeric polypeptide molecule on the 
surface of a lambdoid phage particle, and (3) 
alternatively (i) isolating the surface-expressed 
phage particle using immunoaf f inity techniques such as 

30 panning of phage particles against a preselected 

biological binding activity, thereby isolating one or 
more species of phage containing particular 
polypeptide-encoding genes and protein molecules that 
have the desired biological binding activity, or (ii) 

35 isolating the surface-expressing phage particle using 



- 39 - TSRI 432.0 

enzymatic techniques such as contacting the phage 
particle with a preselected substrate, maintaining the 
phage -substrate under conditions favoring catalysis, 
and detecting the formation of products, thereby 
identifying one or more species of phage containing 
particular polypeptide-encoding genes and protein 
molecules that catalyze a preselected enzymatic 
reaction, or (iii) isolating the surface-expressed 
phage particle using techniques such as panning of 
phage particles immunoreactive with a preselected 
antibody, thereby isolating one or more species of 
phage containing particular polypeptide-encoding genes 
and protein molecules that bind with the preselected 
antibody. 

Alternatively, the surface-expressed phage 
particle can be isolated using immunoaf f inity 
techniques such as immunoprecipitation of the phage 
particles with a preselected antibody, thereby 
isolating one or more species of phage containing 
particular polypeptide-encoding genes and protein 
molecules that immunoreact with the preselected 
antibody. 

The surface-expressed phage particle can also be 
isolated by a functional technique such as an 
enzymatic reaction with a preselected substrate, 
thereby isolating one or more species of phage 
containing particular polypeptide-encoding genes and 
protein molecules that enzymatically react with the 
preselected substrate. 

The surface-expressed phage particle can also be 
isolated by a functional technique such as a ligand- 
binding reaction with a preselected ligand, thereby 
isolating one or more species of phage containing 
particular polypeptide-encoding genes and protein 
molecules that bind to the preselected ligand. 
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As described herein, the resulting phage can be 
manipulated to increase and/or alter the specificities 
or affinities of the polypeptide (s) encoded in the 
lambdoid vector to produce and subsequently identify 
5 additional, desirable, activities of the present 

invention. 

For example, the S-gal -encoding genes can be 
randomly mutagenized and subsequently screened for 
desirable catalytic, affinity, and specificity 

10 capabilities. Alternatively, the S-gal encoding genes 

can be mutagenized in a site-directed manner and 
subsequently screened for desirable catalytic, 
affinity, and specificity capabilities. 

In addition, the BPA-encoding genes can be 

15 randomly mutagenized and subsequently screened for 

desirable affinity and specificity capabilities. 
Alternatively, the BPA-encoding genes can be 
mutagenized in a site-directed manner and subsequently 
screened for desirable affinity and specificity 

20 capabilities. 

In one embodiment, the pV-polypeptide and 
polypeptide monomer genes can be inserted into 
separate, monocistronic expression vectors, referred 
to as a "binary" system. In this method, step (2) 

25 above differs in that the combining of pV-polypeptide 

and polypeptide monomer encoding genes occurs by the 
co- introduction of the two binary plasmids into a 
single host cell for expression and assembly of a 
phage having the surface accessible polypeptide 

3 0 multimeric molecule. 

As a further characterization of the present 
invention the immunoreactivity of the S-gal multimeric 
polypeptide on the surface of lambda phage is 
determined. The immunoreactivity provides essential 

3 5 information regarding the conformation of the 
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multimeric polypeptides. 

Inhibition assays in which various sugars 
competitively inhibit binding of BPA to its ligand, 
mucin, further characterize the present invention. 

Exemplary preparations of functionally active 
multimeric polypeptides displayed on the surface are 
described in the Examples. The isolation of a 
particular vector capable of expressing an polypeptide 
of interest involves the introduction of the 
expression vector into a host cell permissive for 
expression of lambdoid phage genes and the assembly of 
phage particles. Where the binary vector system is 
used, both vectors are introduced in the host cell. 
Typically, the host is E. coli . The resulting host 
cell is cultured to allow the introduced phage and 
polypeptide genes to be expressed, and for phage 
particles to be assembled and released from the host 
cell. The released phage particles are then harvested 
(collected) from the host cell culture media and 
screened for desirable immunoreaction, enzymatic, and 
ligand-binding properties. Typically, the harvested 
particles are "panned" for binding to a preselected 
ligand, enzymatic activity, or for immunoreaction with 
a preselected antibody. The strongly binding, 
enzymatic, or immunoreactive particles are then 
collected, and individual species of particles are 
clonally isolated and further screened for enzymatic- 
activity, binding affinity, or immunoreactivity . 
Phage which produce biologically active polypeptides 
are selected and used as a source of enzymatic, 
ligand-binding, or immunoreactive polypeptides of this 
invention. 

Functionally active polypeptides with altered 
enzymatic, binding, or immunoreactive properties can 
also be produced by altering the nucleotide sequence 
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of a polynucleotide sequence that encodes a 
biologically active polypeptide of this invention. 
For example, by site directed mutagenesis, one can 
alter the nucleotide sequence of an expressed 
polypeptide and thereby introduce changes in the 
resulting expressed amino acid residue sequence. 
Thus, one can take a known polynucleotide and randomly 
alter it by random mutagenesis, reintroduce the 
altered polynucleotide into an expression system, and 
subsequently screen the product polypeptide for a 
desired property or activity. 

Site-directed and random mutagenesis methods are 
well known in the polynucleotide arts, and are not to 
be construed as limiting as methods for altering the 
nucleotide sequence of a subject polynucleotide. 

Due to the presence of the phage particle in an 
immunoaf f inity or activity isolated polypeptide, one 
embodiment involves the manipulation of the resulting 
cloned genes to truncate the polypeptide encoding-gene 
such that a monomeric polypeptide is produced by the 
host E. coli cell containing the phage vector. Thus, 
the resulting manipulated cloned polypeptide encoding- 
genes produce a monomeric polypeptide which assemble 
to form a multimeric complex and can be readily 
characterized in EL ISA assays for ligand-binding 
studies, in competition assays with known antagonists, 
and in enzymatic assays. The soluble monomer provides 
a reproducible and comparable preparation for 
comparative and characterization studies. 

Alternative embodiments for practicing the 
invention are summarized hereafter in several formats 
to emphasize different aspects. 

The method of producing a lambdoid bacteriophage 
comprises the steps of : 
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1) infecting a suitable suppressor host with an 
engineered lambdoid phage vector of this invention 
that encodes a fusion protein; and 

2) culturing the host under phage growth 
5 conditions to express and assemble phage. 

The method of producing a lambdoid bacteriophage 
can alternatively comprise the steps of: 

1) introducing a preselected gene encoding a 
biologically active domain of a cytoplasmic protein 

10 into a cistron on a vector of this invention adapted 

for forming a fusion protein and monomeric proteins; 
and 

2) expressing the vector in a suitable host to 
form assembled phage particles. 

15 preferably, the suppressor host has a termination 

codon suppressor phenotype selected from the group 
consisting of the amber and opal phenotypes. In 
addition, a preferred E. coli host strain is selected 
from the group consisting of strains EQ166, CA168 and 

20 MC8. 

An alternative embodiment uses a separate cistron 
to express the wild-type phage surface protein, either 
on the same genome, on a second helper phage genome, 
or on any other recombinant molecule expressed in the 
25 host. Additionally, the separate cistron can be used 

to supplement the expression of the suppressor cistron 
to vary the ratio of fusion protein to monomer 
protein. 

Another alternative embodiment uses a separate 
30 cistron to express the fusion protein, either on the 

same genome, on a second helper phage genome, or on 
any other recombinant molecule expressed in the host . 

An additional alternative embodiment uses a 
separate cistron to express the monomeric protein, 
35 either on the same genome, on a second helper phage 
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genome, or on any other recombinant molecule expressed 
in the host . 



H. Expression Vectors and Polynuc leotides for 

im pressing Proteins on the Su rface of Lambda 
Phage Particles 

The preparation of bacteriophage lambda 
phage particles which display a functionally active 
protein on their surface depends, in one embodiment, 
on the cloning and expression vectors used to prepare 
the phage particles described herein. 

A recombinant bacteriophage lambda vector of this 
invention produces fusion proteins that assemble with 
monomeric proteins to form a functionally active 
multimeric protein displayed on the surface of an 
assembled lambdoid phage particle. 

A vector of the present invention is a 
recombinant DNA (rDNA) molecule adapted for receiving 
and expressing translatable DNA sequences in the form 
of a fusion polypeptide and a monomeric polypeptide. 
The said fusion polypeptide containing a lambdoid 
phage matrix anchor domain, a linker polypeptide, and 
a monomeric polypeptide. The vector comprises a 
cassette that includes upstream and downstream 
translatable DNA sequences operatively linked via a 
sequence of nucleotides adapted for ligation to an 
insert DNA. The upstream translatable sequence 
encodes the lambdoid phage matrix anchor domain and 
linker peptide as defined herein. The cassette 
preferably includes DNA expression control sequences 
for expressing the fusion polypeptide that is produced 
when an insert translatable DNA sequence (insert DNA) 
is inserted into the cassette via the sequence of 
nucleotides adapted for ligation. The lambdoid phage 
matrix anchor is preferably a portion of the pV tail 
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protein capable of binding the matrix of a lambdoid 
phage particle, thereby incorporating the fusion 
polypeptide onto the phage surface. 

The cassette preferably includes DNA expression 
5 control sequences for expressing the monomeric 

polypeptide that is produced when an insert 
translatable DNA sequence (insert DNA) is 
directionally inserted into the cassette via the 
sequence of nucleotides adapted for ligation. The 

10 monomeric polypeptide is preferably capable of 

associating with the fusion protein and other 
monomeric polypeptides, as required, thereby 
incorporating the monomeric polypeptide onto the phage 
surface. The fusion protein and monomeric proteins 

15 thus form a biologically active multimeric complex on 

the surface of the phage particle. 

In addition, the cassette preferably includes DNA 
expression control sequences for expressing the fusion 
protein. The DNA expression control sequences for 

20 expressing the lambdoid phage matrix anchor are 

upstream and downstream of the nucleotide sequence 
encoding said lambdoid phage matrix anchor. The 
upstream DNA expression control sequences can be those 
found in any lambdoid phage and are regulated by 

25 proteins expressed by the lambdoid genome. The 

downstream DNA expression control sequence is a 
suppressor codon at the fusion point for conditional 
regulation of a read-through (fusion) product. The 
suppressor codon provides a means for the conditional 

3 0 expression of the lambdoid phage matrix' anchor and the 

fusion product which comprises the lambdoid phage 
matrix anchor polypeptide, linker polypeptide, and 
downstream insert translatable DNA sequences. The 
lambdoid phage matrix anchor polypeptide is preferably 

35 capable of assembling with other lambdoid phage 
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polypeptides to form a functional lambdoid phage tail, 
thereby providing the means for infection of a 
suitable bacterial host. 

An expression vector is characterized as being 
capable of expressing, in a compatible host, 
structural gene products such as a fusion polypeptide 
and a lambdoid phage matrix anchor gene product of the 
present invention. In one embodiment, an expression 
vector is also characterized as being capable of 
expressing, in a compatible host, a heterologous 
monomeric gene product such as a monomeric polypeptide 
of the present invention for association with the 
fusion protein to form a multimeric protein complex in 
the phage matrix. 

As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting between 
different genetic environments another nucleic acid to 
which it has been operatively linked. Preferred 
vectors are those capable of autonomous replication 
and expression of structural gene products present in 
the DNA segments to which they are operatively linked. 

As used herein with regard to DNA sequences or 
segments, the phrase "operatively linked" means the 
sequences or segments have been covalently joined, 
preferably by conventional phosphodiester bonds, into 
one strand of DNA, whether in single or double 
stranded form. 

The choice of vector to which a cassette of this 
invention is operatively linked depends directly, as 
is well known in the art, on the functional properties 
desired, e.g., vector replication and protein 
expression, and the host cell to be transformed, these 
being limitations inherent in the art of constructing 
recombinant DNA molecules. 

In preferred embodiments, the vector utilized 



- 47 - TSRI 432.0 

includes a prokaryotic replicon i.e., a DNA sequence 
having the ability to direct autonomous replication 
and maintenance of the recombinant DNA molecule extra 
chromosomal ly in a prokaryotic host cell, such as a 
bacterial host cell, transformed therewith. Such 
replicons are well known in the art. Vectors 
typically also contain convenient restriction sites 
for insertion of translatable DNA sequences. 
Exemplary vectors are the lambda vectors EMBL3 , EMBL4 , 
FIX, FIX® II, DASH, DASH® II, ZAP, ZAP® II, and ZAP 
Express™ available from Stratagene (La Jolla, CA) and 
ExCell available from Pharmacia, (Piscataway, NJ) . In 
addition, embodiments may also include the ce-peptide 
of the fi-gal gene to facilitate the identification of 
recombinant vectors containing insert DNAs . 

A sequence of nucleotides adapted for ligation, 
i.e., a polylinker or multiple cloning site (mcs) , is 
a region of the DNA expression vector that (1) 
operatively links for replication the upstream and 
downstream translatable DNA sequences and (2) provides 
a site or means for insertion of a DNA sequence into 
the vector. Typically, a polylinker is a sequence of 
nucleotides that defines two or more restriction 
endonuclease recognition sequences, or restriction 
sites. Upon restriction cleavage, the site yields 
cohesive termini to which a translatable DNA sequence 
can be ligated to the DNA expression vector. In 
addition, a polylinker can be cleaved with two 
restriction endonucleases to form two cohesive termini 
to which a translatable DNA sequence can be ligated to 
the DNA expression vector. Preferably, the two 
restriction sites provide, upon restriction cleavage, 
cohesive termini that are non-complementary and 
thereby permit directional insertion of a translatable 
DNA sequence into the cassette. In one embodiment, 
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the directional ligation means is provided by 
nucleotides present in the upstream translatable DNA 
sequence, in a sequence that encodes a linker 
polypeptide, in a downstream translatable DNA 
sequence, or in two or more such sequences. In 
another embodiment, the sequence of nucleotides 
adapted for directional ligation comprises a sequence 
of nucleotides that defines multiple directional 
cloning means. Where the sequence of nucleotides 
adapted for directional ligation defines numerous 
restriction sites, it is referred to as a multiple 
cloning site. 

A translatable DNA sequence is a linear series of 
nucleotides that provide an uninterrupted series of at 
least 8 codons that encode a polypeptide in one 
reading frame . 

An upstream translatable DNA sequence encodes a 
lambdoid phage matrix anchor (tail polypeptide) . 
Preferred matrix anchors are obtainable from lambdoid 
phage X, 080, 081, and the like equivalent lambdoid 
phage . Preferred matrix anchor domains are found in 
the coat or tail proteins encoded by gene D and genes 
V and J, respectively. Thus, an upstream translatable 
DNA sequence encodes an amino acid residue sequence 
that corresponds, and preferably is identical, to the 
matrix anchor domain of either a lambdoid phage coat 
or tail protein. The matrix anchor domain of a 
lambdoid phage tail protein, pV, is preferably a 
portion of the amino terminal region of the tail 
protein and includes a region of amino acid residue 
sequences 1 to 176. pV tail protein is present on a 
mature lambdoid phage along the tail with typically 
about 192 copies. of the tail protein. 

A cassette in a DNA expression vector of this 
invention is the region of the vector that forms, upon 
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insertion of a translatable DNA sequence (insert DNA) , 
a sequence of nucleotides capable of expressing, in an 
appropriate host, a fusion polypeptide of this 
invention. 

5 The expression-competent sequence of nucleotides 

is referred to as a cistron. Thus, the cassette 
comprises DNA expression control elements operatively 
linked to the upstream and downstream translatable DNA 
sequences. A cistron is formed when a translatable 

10 DNA sequence is directionally inserted (directionally 

ligated) between the upstream and downstream sequences 
via the sequence of nucleotides adapted for that 
purpose. The resulting translatable DNA sequences, 
namely the upstream and the inserted sequences, are 

15 all operatively linked in the same reading frame. 

DNA expression control sequences comprise a set 
of DNA expression signals for expressing a gene 
product and include both 5' and 3' elements, as is 
well known, operatively linked to the cistron such 

20 that the cistron is able to express a gene product. 

The 5' control sequences define a promoter for 
initiating transcription and a ribosome binding site 
operatively linked at the 5' terminus of the upstream 
translatable DNA sequence, 

25 To achieve high levels of gene expression in E . 

coli , it is necessary to use not only strong promoters 
to generate large quantities of mRNA, but also 
ribosome binding sites to ensure that the mRNA is 
efficiently translated. In E. coli , the ribosome 

30 binding site includes an initiation codon (AUG) and a 

sequence 3-9 nucleotides long located 3-11 nucleotides 
upstream from the initiation codon (Shine et al . , 
Nature , 254:34, 1975). Alternatively, the initiation 
codon in £. coli may be-GUG. The sequence, AGGAGGU, 

35 which is called the Shine-Dalgarno (SD) sequence, is 
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complementary to the 3' end of E. coii 16S mRNA. 
Binding of the ribosome to mRNA and the sequence at 
the 3' end of the mRNA can be affected by several 
factors : 

(i) The degree of complementarity between the SD 
sequence and 3' end of the 16S tRNA. 

(ii) The spacing and possibly the DNA sequence 
lying between the SD sequence and the initiator codon 
(Roberts et al . , Proc. Natl. Acad, Sci. USA , 76:760, 
1979a; Roberts et al . , Proc . Natl, Acad. Sci. USA , 
76:5596, 1979b; Guarente et al . , Science , 209:1428, 
1980; and Guarente et al . , Cell , 20:543, 1980). 
Optimization is achieved by measuring the level of 
expression of genes in plasmids in which this spacing 
is systematically altered. Comparison of different 
mRNAs shows that there are statistically preferred 
sequences from positions -20 to +13 (where the A of 
the AUG is position 0) (Gold et al . , Annu . Rev . 
Microbiol . , 35:365, 1981). Leader sequences have been 
shown to influence translation dramatically. 

(iii) The nucleotide sequence following the 
initiator codon, which affects ribosome binding 
(Taniguchi et al . , J. Mol . Biol. , 118:533, 1978). 

The 3' control sequences define at least one 
termination (stop) codon in frame with and operatively 
linked to the inserted translatable DNA sequence. The 
preferred stop codon is a suppressor codon that is 
regulated by conditional suppression. Conditional 
suppression refers to the use of nonsuppressor and 
suppressor hosts to express the encoded polypeptide 
under different conditions. In a nonsuppressor host, 
a tRNA does not insert an amino acid residue at the 
position of the amber codon resulting in termination 
of translation of the upstream lambda phage matrix 
anchor polypeptide. In a suppressor host, a mutant 
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tRNA inserts an amino acid residue at the position of 
the amber codon resulting in a read-through at the 
position of the amber codon and translation of the 
downstream nucleotide sequences. Read-through thus 
5 results in the expression of a fusion protein 

comprising the lambda matrix anchor protein, a linker 
polypeptide, and an inserted translatable DNA 
sequence. In addition, in a suppressor host, 
initiation of translation of the inserted translatable 

10 DNA sequence occurs from 5' expression control 

elements contained within the linker peptide and 
results in translation of the inserted translatable 
DNA sequence. Further discussion regarding 
conditional suppression is described herein. 

15 Thus, a DNA expression vector of this invention 

provides a system for inserting translatable DNA 
sequences into the cassette portion of the vector to 
produce a cistron capable of expressing a fusion 
polypeptide of this invention. 

20 In one embodiment, a DNA expression vector 

provides a system for inserting (cloning) a 
translatable DNA sequence into a single cassette 
present in the vector, to form two cistrons for 
expressing two polypeptides. The two polypeptides 

25 expressed are (1) a fusion polypeptide and (2) a 

truncated phage tail polypeptide, each capable of 
associating in the phage matrix to form a functional 
phage particle. The fusion polypeptide is formed in a 
suppressor host which allows conditional (partial) 

30 read-through of the suppressor termination codon, and. 

the truncated tail polypeptide is formed where read- 
through does not occur and termination occurs at the 
suppressor termination codon. 

In another embodiment, a DNA expression vector 

35 provides a system for inserting (cloning) a 
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translatable DNA sequence into a single cassette 
present in the vector, to form three cistrons for 
expressing three polypeptides. Two of said 
polypeptides assemble to form a multimeric complex. 
5 The two polypeptides expressed are a fusion 

polypeptide and a monomeric polypeptide capable of 
associating to form a multimeric protein. The 
monomeric polypeptide is translated by initiation of 
translation using a second ribosome-binding-site 

10 located in the sequence defining the linker 

polypeptide. The third polypeptide in this embodiment 
is the truncated phage tail polypeptide produced by 
termination at the suppressor termination codon as 
discussed above. 

15 Alternatively, a DNA expression vector of this 

invention can comprise, in addition to the cassette 
previously described in detail, a second cassette for 
expressing a monomeric polypeptide. The second 
cassette includes a sequence of nucleotides adapted 

20 for directional ligation to a downstream DNA sequence 

of the vector that typically defines at least one stop 
codon in the reading frame of the cassette. The 
second cassette includes a translatable DNA sequence 
encoding the monomeric polypeptide and is operatively 

25 linked at its 5' terminus to DNA expression control 

sequences forming the 5' elements defined above. A 
ribosome binding site of the second cassette is also 
contained within the 5' control elements of the 
polypeptide linker described herein. The second 

3 0 cassette is capable, upon insertion of a translatable 

DNA sequence (insert DNA) , of expressing the monomeric 
polypeptide . 

In another alternative embodiment, the second 
cassette for expressing monomeric polypeptide can be 

35 contained on a separate vector independent of the 
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vector containing the cistron that encodes the fusion 
protein . 

In a preferred embodiment, a DNA expression 
vector is designed for convenient manipulation in the 
5 form of a lambdoid phage particle encapsulating a 

genome according to the teachings of the present 
invention. In this embodiment, a DNA expression 
vector further contains a nucleotide sequence that 
defines a lambdoid phage origin of replication such 

10 that the vector can replicate as a lambdoid phage in 

double stranded replicative form and be packaged into 
lambdoid phage particles. This feature provides the 
ability of the DNA expression vector to be packaged 
into phage particles for subsequent segregation of the 

15 particle, and vector contained therein, away from 

other particles that comprise a population of phage 
particles . 

A lambdoid phage origin of replication is a 
region of the phage genome, as is well known, that 

20 defines sites for initiation of replication and 

termination of replication „ 

A preferred lambdoid phage origin of replication 
for use in the present invention is a X, 080, or 081 
phage origin of replication. Particularly preferred 

25 is a lambdoid phage origin of replication described 

herein the Examples. Preferred DNA expression vectors 
are the expression vectors XV' sac, XV'mcs, Xblue, 
Xblue-a, and Xfoo described in Example 1 . 

Insofar as a vector of this invention may be 

3 0 manipulated to contain an insert DNA, thereby having 

the capacity to express a fusion polypeptide and a 
monomeric polypeptide, one embodiment contemplates the 
previously described vectors containing an insert DNA. 
Particularly preferred vectors containing multimeric 

3 5 protein genes are described in the Examples. 
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In preferred embodiments, the vector utilized is 
capable of replication in a suitable procaryotic host 
cell and includes a prokaryotic origin of replication 
or replicon, i.e., a DNA sequence having the ability 
5 to direct autonomous replication and maintenance of 

the recombinant DNA molecule extra chromosomal ly in a 
prokaryotic host cell, such as a bacterial host cell, 
transformed therewith. Such origins of replication 
are well known in the art. Preferred origins of 

10 replication are those that are efficient in the host 

organism. A preferred host cell is E. coli. For use 
of a vector in E. coli , a preferred origin of 
replication is the A ori . 

Alternatively, the vector utilized includes 

15 sequences encoding gene products that promote 

insertion of the recombinant DNA molecule into the DNA 
of the prokaryotic host cell, such as a bacterial host 
cell, transformed therewith. The vector utilized also 
includes sequences encoding gene products that promote 

20 the maintenance of insertion of the recombinant DNA 

molecule in the DNA of the prokaryotic host cell, such 
as a bacterial host cell, transformed therewith. The 
recombinant DNA molecule so inserted replicates as 
part of the prokaryotic host cell chromosome. This 

25 process of insertion, maintenance, and replication is 

known as lysogenization. Such gene products that 
promoter lysogenization are well known in the art. 
Preferred gene products which promote lysogenization 
are those that are efficient in the host organism. A 

3 0 preferred host cell is E . coli . Preferred gene 

products are those expressed by X. 

A vector for expression of a functionally active 
protein of the invention on the surface of a lambdoid 
phage particle is a recombinant DNA (rDNA) molecule 

3 5 adapted for receiving and expressing translatable DNA 
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sequences . 

In one embodiment, the expressed translatable DNA 
sequence is a monomer of a multimeric protein. The 
rDNA molecule of this invention is capable of 
expressing the translatable DNA sequence as a fusion 
polypeptide fused to a portion of a lambdoid matrix 
anchor protein. That is, one of the polypeptides 
expressed is a fusion polypeptide comprising a 
lambdoid phage matrix anchor polypeptide, linker 
polypeptide, and a preselected monomer polypeptide. 

A DNA expression vector for expressing a 
functionally active molecule provides a system for 
cloning (inserting) a translatable DNA sequence into a 
cassette present in the vector, to form three separate 
cistrons for expressing the first, second, and third 
polypeptides of the functionally active molecule, or 
the phage tail polypeptide, fusion polypeptide, and 
monomeric polypeptide that comprise the phage tail 
molecule and multimeric molecule. The DNA expression 
vector for expressing three cistrons is referred to as 
a tricistronic expression vector. 

A DNA expression vector of the present invention 
provides translatable DNA sequences comprising a 
linker polypeptide. The linker polypeptide can 
provide several features which include: 1) a 
suppressor codon, 2) translatable DNA sequences 
comprising a flexible amino acid sequence and a 
proteolytic cleavage site, 3) a DNA expression control 
element for expression of the downstream translatable 
insert DNA, and 4) a sequence of nucleotides adapted 
for ligation to an insert DNA. 

The suppressor codon allows for the expression of 
the fusion polypeptide described herein in a suitable 
suppressor host. In a nonsuppressor host, the 
suppressor codon allows for the translational 
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termination of the upstream DNA translatable sequence. 

The linker polypeptide can also provide a means 
for control of DNA expression of the downstream 
polypeptide. The 5' control sequences define a 
ribosome binding site which is operatively linked at 
the 5 ' terminus of the downstream polypeptide. The 
ribosome binding site is linked to the cistron such 
that the cistron is able to express a structural gene 
product . 

The translatable DNA sequences which encode the 
linker polypeptide also provide a means for ligation 
of an insert DNA into the expression vector. 

A preferred recombinant bacteriophage lambda 
vector is a recombinant DNA (rDNA) molecule comprising 
a nucleotide sequence that (i) defines the lambdoid 
elements for replication and packaging of the vector 
into an assembled bacteriophage as described herein, 
and (ii) encodes a conditionally suppressible cistron 
for expression of both a tail protein and a fusion 
protein. The conditionally suppressible cistron 
comprises : 

a) a promoter for transcribing the cistron 
operatively linked 5' to the reading frame of the 
cistron, 

b) a first upstream translatable sequence that 
encodes a lambdoid bacteriophage tail polypeptide, 

c) a first ribosome binding site to initiate 
translation of the upstream translatable sequence, 

d) a second translatable sequence operatively 
linked downstream to the first translatable sequence 
that (i) encodes a linker polypeptide in frame with 
tail polypeptide and (ii) includes a sequence adapted 
for ligation of an insert polynucleotide that defines 
a third translatable sequence downstream from the 
second translatable sequence that encodes a 
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preselected polypeptide, and 

e) a suppressor termination codon within the 
second translatable sequence that upon suppression 
results in read-through to form a fusion polypeptide 
5 consisting of the tail polypeptide, linker polypeptide 

and preselected polypeptide. 

In this embodiment, upon insertion of a insert 
polynucleotide that encodes a preselected polypeptide 
to be displayed on the fusion protein into the 

10 sequence adapted for ligation, the vector can produce 

two proteins when grown in a conditional suppressor 
host strain that partially suppresses termination, 
resulting is a ratio of two translation products. The 
two products are (1) a truncated tail protein produced 

15 by termination at the non- suppressed suppressor codon, 

and (2) a fusion protein comprising, in the direction 
of amino- to carboxy- terminus , a phage tail 
polypeptide domain, a linker polypeptide domain and a 
preselected polypeptide domain produced by suppression 

20 and read- through of the suppressor codon and 

translation of the third translatable sequence. Thus, 
the degree of suppression controls the ratio of the 
two products produced by the cistron in the vector. 
This embodiment has the advantage of providing 

25 both the fusion protein and the bacteriophage tail 

protein. By selection of the suppressor host 
properly, the ratio can be adjusted such that read- 
through is a rare event, yielding 1 to 10 copies of 
fusion protein to 200 copies of the truncated tail 

3 0 protein. A single vector can thereby produce all the 

. tail components required for assembly of infectious 
lambdoid bacteriophage including the proper amount of 
fusion protein. 

In a related embodiment, the vector can further 

35 contain a nucleotide sequence that defines a second 
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ribosome binding site in the above described second 
translatable sequence (linker polypeptide domain) that 
initiates translation of the third translatable 
sequence, i.e., the preselected polypeptide as a free 
non-fused polypeptide. Thus, the same vector that 
produced the above two polypeptides, namely the 
truncated tail protein and the fusion protein, can 
also produce a third protein by initiation of 
translation at the second ribosome binding site to 
produce the preselected polypeptide as a free protein 
rather that as a portion of the fusion protein. 

This embodiment provides the advantage of 
producing the monomer subunits of the preselected 
polypeptide of a multimer in addition to the fusion 
protein subunit of the multimer. 

A preferred nucleotide sequence in the first 
upstream translatable sequence that encodes a tail 
polypeptide encodes any of the lambdoid tail proteins, 
preferably selected from the group consisting of pJ, 
PY, pG, pM and pT. A particularly preferred tail 
protein is pV, and more preferably, the fusion protein 
includes residues 1-176 of the sequence of pV shown in 
SEQ ID NO 1, or conservative substitutions thereof. 

A preferred suppressor termination codon is 
either the amber or opal codons, and depends upon the 
suppressor strain to be utilized in conjunction with 
the vector, as is described herein. 

A preferred second translatable sequence that 
defines a linker can be any sequence that encodes a 
linker as described herein, and preferably includes 
the sequence adapted for ligation (a multiple cloning 
site or sites) , preferably directional ligation. A 
preferred linker polypeptide is the Pro-Thr box of C. 
f imi endoglucanase and exoglucanase as described 
herein. An additional preferred linker is described 
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in the Examples and has a nucleotide sequence from 
base number 547 to 654 of SEQ ID NO 5 . 

A cassette in a DNA expression vector of this 
invention is the region of the vector that forms, upon 
5 insertion of a translatable DNA sequence (insert DNA), 

a sequence of nucleotides capable of expressing, in an 
appropriate host,, a polypeptide. The expression- 
competent sequence of nucleotides is referred to as a 
cistron. Thus, the cassette comprises DNA expression 

10 control elements operatively linked to the upstream 

translatable DNA sequences. A cistron is formed when 
a translatable DNA sequence is inserted (ligated) into 
the expression vector via the sequence of nucleotides 
adapted for that purpose. The resulting three 

15 translatable DNA sequences, namely the first upstream, 

the second upstream (linker) and the inserted 
sequences, are all operatively linked. 

In so far as promoters in prokaryotes may control 
transcription of large polycistronic messenger RNA 

20 molecules, the promoter in the present vector can be 

positioned at any of a variety of locations in the 
lambdoid genome and need not be immediately adjacent 
to the 5' ribosome binding site that initiates 
translation of the cistron that encodes the fusion 

25 protein of this invention. 

Thus, a DNA expression vector for expressing a 
phage tail polypeptide, a fusion polypeptide, and 
optionally a monomeric polypeptide provides a system 
for cloning preselected translatable DNA sequences 

30 into the cassette portions of the vector to produce 

cistrons capable of expressing the first and second, 
and optionally third polypeptides, e.g., the pV, the 
pV-monomer (fusion polypeptide) , and monomer 
polypeptides. The fusion and monomer polypeptides 

35 assemble to form a multimeric protein. 
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Insofar as polynucleotides are component parts of 
a DNA expression vector for producing a fusion 
polypeptide residue sequence, the invention also 
contemplates isolated polynucleotides that encode such 
fusion sequences. 

It is to be understood that, due to the genetic 
code and its attendant redundancies, numerous 
polynucleotide sequences can be designed that encode a 
contemplated pV residue sequence. Thus, the invention 
contemplates such alternate polynucleotide sequences 
incorporating the features of the redundancy of the 
genetic code. 

Insofar as the expression vector for producing a 
fusion polypeptide of this invention is carried in a 
host cell compatible with expression of the fusion 
polypeptide, the invention contemplates a host cell 
containing a vector or polynucleotide of this 
invention. A preferred host cell is E. coli , as 
described herein. 

An E. coli host, MC8, capable of replicating and 
expressing the preferred expression vector of this 
invention, was deposited pursuant to Budapest Treaty 
requirements with the American Type Culture Collection 
(ATCC) , Rockville, MD, as described herein. Lambdoid 
genomic DNA, yfoo, comprising a preferred expression 
vector that produces a fusion polypeptide of this 
invention was deposited pursuant to Budapest Treaty 
requirements with the American Type Culture Collection 
(ATCC) , Rockville, MD, as described herein. 

1 • Multimer Conf icrurations 

A particularly preferred cistron that 
encodes a lambda pV tail polypeptide, a preferred 
linker, and a second translatable sequence that 
encodes the linker together with a sequence adapted 
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for ligation of an insert DNA and the a-peptide of S- 
gal is shown in SEQ ID NO 5. This cistron includes 
two additional overlapping cistrons that encode a 
lambda truncated tail polypeptide produced by 
5 termination at the non- suppressed suppressor codon 

(from base number 1 to 545 of SEQ ID NO 5) and a 
sequence adapted for ligation of an insert 
polynucleotide that defines a third translatable 
sequence that encodes a preselected polypeptide (from 

10 base number 643 to 728 of SEQ ID NO 5) . 

The invention can be practiced so as to produce a 
monomer fusion protein, but is particularly suited for 
expression of multimers: homomultimers and 
heteromultimers which assemble in the cytoplasm. To 

15 that end, the cistrons which express the subunits can 

be located within the fusion protein cistron, i.e., 
overlapping, or can be a second non -over lapping 
cistron. In this latter configuration, the monomer 
can be a polypeptide different from the fusion 

20 protein's preselected polypeptide, and can be used to 

form heteromultimers. The cistron for expression of 
the different polypeptides can be located in cis or 
trans relative to the fusion protein cistron. 

25 I - Libraries of Phage 

The present invention contemplates a library 
of DNA molecules that each encode a fusion protein of 
this invention where the library is in the form of a 
population of different lambdoid phage particles each 

3 0 containing a different rDNA molecule of this 

invention. By different rDNA molecule is meant rDNA 
molecules differing in nucleotide base sequence 
encoding the preselected polypeptide of the fusion 
protein. 

35 
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Thus, a phage library is a population of lambdoid 
phage, preferably X, 080, or 081 lambdoid phage, each 
phage having packaged inside the particle a rDNA 
expression vector of this invention. A preferred 
5 library is comprised of phage particles containing DNA 

molecules that encode at least 10 6 , preferably 10 7 and 
more preferably 10 8 ' 9 different fusion proteins of this 
invention. By different fusion proteins is meant 
fusion proteins differing in amino acid residue 

10 sequence. Where the packaged expression vector encodes 

polypeptides of an autogenously assembling receptor, 
e.g. S-gal or BPA monomers that assemble to form a 
biologically active enzyme or lectin, respectively; 
the library can also be characterized as containing or 

15 expressing a multiplicity of receptor specificities. 

Thus, preferred libraries express at least 10 5 , 
preferably at least 10 6 and more preferably at least 
10 7 different receptors, such as different enzymes, 
lectins, and the like. 

20 As described herein, a particular advantage of a 

lambdoid phage in the present invention is that the 
DNA molecule present in the phage particle and 
encoding the multimeric receptor can be segregated 
from other DNA molecules present in the library on the 

25 basis of the presence of the particular expressed 

fusion protein on the surface of the phage particle. 

Isolation (segregation) of a DNA molecule 
encoding one or more members of a multimeric receptor 
is conducted by segregation of the lambdoid phage 

3 0 particle containing the gene or genes of interest away 

from the population of other phage particles 
comprising the library. Segregation of phage 
particles involves the physical separation and 
propagation of individual phage particles away from 

35 other particles in the library. Methods for physical 
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separation of lambdoid phage particles to produce 
individual particles, and the propagation of the 
individual particles to form populations of progeny 
phage derived from the individual segregated particle 
5 are well known in the lambdoid phage arts. 

A preferred separation method involves the 
identification of the expressed multimer on the 
surface of the phage particle by means of substrate- 
or ligand-binding specificity between the phage 
10 particle and a preselected substrate or ligand. 

Exemplary and preferred is the use of "panning" 
methods whereby a suspension of phage particles is 
contacted with a solid phase substrate or ligand and 
allowed to specifically bind. After binding, non- 
15 bound particles are washed off the solid phase, and 
the bound phage particles are those that contain 
substrate- or ligand-specif ic multimeric receptor 
(homotetramer) on their surface. The bound particles 
can then be recovered by elution of the bound particle 
2 0 from the solid phase, typically by the use of a 

protease which provides a means for the separation of 
the phage particle containing the recombinant genome 
and the multimeric receptor. An alternative method 
for recovering the bound particles by elution of the 
25 bound particle from the solid phase is by the use of 

aqueous solvents having high ionic strength sufficient 
to disrupt the receptor-ligand or receptor- substrate 
binding interaction . 

An alternate method for separating a phage 
30 particle based on the ligand specificity of the 

surface-expressed multimer from a population of 
particles is to precipitate the phage particles from 
the solution phase by crosslinkage with the ligand. 
Such methods are well known in the immunological arts. 
35 The use of the above particle segregation methods 
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provides a means for screening a population of 
lambdoid phage particles present in a phage library of 
this invention. As applied to a phage library, 
screening can be utilized to enrich the library for 
5 one or more particles that express a multimer having a 

preselected substrate or ligand binding specificity. 
Where the library is designed to contain multiple 
species of multimers that all have some detectable 
measure of ligand binding activity, but differ in 

10 protein structure, ligand binding affinity or avidity, 

and the like, the screening methods can be utilized 
sequentially to first produce a library enriched for a 
preselected binding specificity, and then to produce a 
second library further enriched by further screening 

15 comprising one or more isolated phage particles. 

Methods for measuring ligand binding activities, and 
the like interactions between a ligand and a receptor 
are generally well known and are not discussed further 
as they are not essential features of the present 

20 invention. 

Thus, in one embodiment, a phage library is a 
population of particles enriched for a preselected 
ligand binding specificity. 

In another embodiment, a phage library comprises 

25 a population of particles wherein each particle 

contains at least one fusion protein of this invention 
on the surface of the phage particle. The actual 
amount of fusion protein present on the surface of a 
phage particle depends, in part, on the choice of coat 

3 0 protein membrane anchor present in the fusion protein. 

Where the anchor is derived from pD, there is the 
potential for hundreds of fusion proteins on. the 
particle surface depending on the growth conditions 
and other factors . Where the anchor is derived from 

35 the more preferred pV, there are typically about 1 to 
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4 fusion proteins per phage particle. Preferably, a 
phage particle in a library contains from about 1 to 
about 10 pV-derived fusion proteins on the surface of 
each particle, and more preferably about 1 to 5 fusion 
proteins per particle. Exemplary amounts of surface 
fusion protein are shown by the electron micrographs 
described in the Examples that describe particles 
having about 1 to 3 pV-derived fusion proteins per 
particle. 

In another embodiment, the present invention 
contemplates a population of phage particles that are 
the progeny of a single particle, and therefor all 
express the same multimer on the particle surface. 
Such a population of phage are homogeneous and 
clonaliy derived, and therefore provide a source for 
expressing large quantities of a particular fusion 
protein. 

1 . Methods for Producing a Library 
a. General Rationale 

The present invention provides a 
system for the simultaneous cloning and screening of 
preselected substrate- or ligand-binding specificities 
or preselected biological activity specificities from 
gene libraries using a single vector system. This 
system provides linkage of cloning and screening 
methodologies and has two requirements. First, that 
expression of the polypeptide chains of a multimeric 
protein complex in an in vitro expression host such as 
E. coli requires expression of the monomeric 
polypeptide chains in order that a functional 
multimeric receptor can assemble to produce a complex 
that has a desired biological function. Second, that 
screening of isolated members of the library for a 
preselected biological activity requires a means to 
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correlate (a linkage) the activity of an expressed 
molecules with a convenient means to isolate the gene 
that encodes the member from the library. 

Linkage of expression and screening is 
accomplished by the combination of expression of a 
fusion polypeptide in the cytoplasm of a bacterial 
cell to allow assembly of a functional complex, and 
the targeting of a fusion polypeptide onto the tail of 
a lambdoid phage particle during phage assembly to 
allow for convenient screening of the library member 
of interest. Targeting to a phage particle is 
provided by the presence of a lambdoid phage tail 
protein matrix anchor domain (i.e., a pV- derived 
matrix anchor domain) in a fusion polypeptide of this 
invention. 

The present invention contemplates in one 
embodiment a method for producing a library of DNA 
molecules, each DNA molecule comprising a cistron for 
expressing a fusion polypeptide on the surface of a 
lambdoid phage particle. The method comprises the 
steps of (a) forming a ligation admixture by combining 
in a ligation buffer (i) a population of preselected 
polypeptide encoding genes and (ii) a plurality of DNA 
expression vectors in linear form adapted to form 
fusion polypeptide- and pV polypeptide-expressing 
cistrons, and (b) subjecting the admixture to ligation 
conditions, for a time period sufficient for the 
library of genes to become operatively linked 
(ligated) to the plurality of vectors to form the 
library. 

In this embodiment, the library of polypeptide 
encoding genes are in the form of double- stranded (ds) 
DNA and each member of the population has cohesive 
termini adapted for ligation. In addition, the 
plurality of DNA expression vectors are each linear 
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DNA molecules having upstream and downstream cohesive 
termini that are (a) adapted for receiving the 
polypeptide genes, and (b) operatively linked to 
respective upstream and downstream translatable DNA 
sequences. The upstream translatable DNA sequence 
encodes a lambdoid phage tail protein matrix anchor, 
preferably a portion of the pV polypeptide preceding a 
linker polypeptide as described herein for a 
polypeptide of this invention. The downstream 
translatable DNA sequence encodes a preselected 
polypeptide. The translatable DNA sequences are also 
operatively linked to respective upstream and 
downstream DNA expression control sequences as defined 
for a DNA expression vector described herein. 

The library so produced can be utilized for 
expression and screening of the assembled fusion 
polypeptide and monomeric polypeptides encoded by the 
resulting library of cistrons represented in the 
library by the expression and screening methods 
described herein. 

2 . Production of Gene Libraries 

A gene library is a collection of 
different genes, preferably polypeptide -encoding genes 
(polypeptide genes) , and may be isolated from natural 
sources or can be generated artificially. Preferred 
gene libraries comprise genes that code for the 
members of a monomeric or multimeric molecule of 
interest . 

A gene library useful in practicing the present 
invention contains at least 10 3 , preferably at least 
10 4 , more preferably at least 10 5 , and most preferably 
at least 10 7 different genes. Methods for evaluating 
the diversity of a library of genes is well known to 
one skilled in the art. 



- 68 - TSRI 432.0 

Thus, in one embodiment, the present invention 
contemplates a method of isolating genes coding for a 
multimeric protein complex having a preselected 
activity from a collection of genes. Additionally, 
5 expressing the cloned genes and isolating the 

resulting expressed multimeric protein complex is also 
described. Preferably, the complex will be a 
multimeric polypeptide capable of binding a ligand, 
such as a cytoplasmic an enzyme or lectin of interest. 

10 Various well known methods can be employed to 

produce a useful gene library. In some cases, it is 
"desirable to bias a library for a preselected 
activity, such as by using as a source of nucleic acid 
cells (source cells) at various stages of age, health, 

15 stimulation, or disease state. 

In addition, a library of diverse nucleotide 
sequences can be produced by directed or random 
mutagenesis of a preselected gene or population of 
g enes _ Methods of mutagenesis are well known and are 

2 0 not to be considered limiting to the present 

invention. 

Alternatively, it should be noted that the 
greater the genetic heterogeneity of the population of 
cells from which the nucleic acids are obtained, the 

25 greater the diversity of the library that will be made 

available for screening according to the method of the 
present invention. Thus, cells from different 
individuals and cells from different strains, races or 
species can be advantageously combined to increase the 

30 heterogeneity (diversity) or a library. 

Methods for preparing fragments of genomic DNA 
from which genes can be cloned as a diverse population 
are well known in the art. See for example Herrmann 
et al., Methods In Enzvmol . , 152:180-183, 1987; 

35 Frischauf, Methods In Enzvmol . , 152:183-190, 1987; 
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Frischauf, Methods In Enzvmol . , 152:190-199, 1987; and 
DiLella et al Methods In Enzvmol . , 152:199-212, 
1987) . 

The desired polypeptide-encoding genes can be 
5 isolated from either genomic material containing the 

gene expressed or the messenger RNA (mRNA) which 
represents a transcript comprising the coding region 
of the gene. The difficulty in using the genomic DNA 
is in juxtaposing the sequences coding for 

10 polypeptides, where the sequences are separated by 

introns . The DNA fragment (s) containing the proper 
exons must be isolated, the introns excised, and the 
exons then spliced in the proper order and in the 
proper orientation. For the most part, this will be 

15 difficult, so that the alternative technique employing 

the isolation of mRNA will be the method of choice 
because the coding regions of the gene have 
translocated to become adjacent, so that the sequence 
is continuous (free of introns} for the entire 

20 polypeptide-coding regions. 

Where mRNA is utilized the cells will be lysed 
under Rnase inhibiting conditions. In one embodiment, 
the first step is to isolate the total cellular mRNA. 
Poly A + mRNA can then be selected by hybridization to 

25 an oligo-dT cellulose column. 

a. Preparation of Polynucleotide 
Primers 

The term "polynucleotide" as used 
3 0 herein in reference to primers, probes and nucleic 

acid fragments or segments to be synthesized by primer 
extension is defined as a molecule comprised of two or 
more deoxyribonucleotide or ribonucleotides , 
preferably more than 3. Its exact size will depend on 
35 many factors, which in turn depends on the ultimate 
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conditions of use. 

The term "primer" as used herein refers to a 
polynucleotide whether purified from a nucleic acid 
restriction digest or produced synthetically, which is 
capable of acting as a point of initiation of nucleic 
acid synthesis when placed under conditions in which 
synthesis of a primer extension product which is 
complementary to a nucleic acid strand is induced, 
i.e., in the presence of nucleotides and an agent for 
polymerization such as DNA polymerase, reverse 
transcriptase and the like, and at a suitable 
temperature and Ph. The primer is preferably single 
stranded for maximum efficiency, but may alternatively 
be in double stranded form. If double stranded, the 
primer is first treated to separate it from its 
complementary strand before being used to prepare 
extension products. Preferably, the primer is a 
polydeoxyribonucleotide . The primer must be 
sufficiently long to prime the synthesis of extension 
products in the presence of the agents for 
polymerization. The exact lengths of the primers will 
depend on may factors, including temperature and the 
source of primer. For example, depending on the 
complexity of the target sequence, a polynucleotide 
primer typically contains 15 to 25 or more 
nucleotides, although it can contain fewer 
nucleotides. Short primer molecules generally require 
cooler temperatures to form sufficiently stable hybrid 
complexes with template. 

The primers used herein are selected to be 
"substantially" complementary to the different strands 
of each specific sequence to be synthesized or 
amplified. This means that the primer must be 
sufficiently complementary to non-randomly hybridize 
with its respective template strand. Therefore, the 



- 71 - TSRI 432.0 

primer sequence may or may not reflect the exact 
sequence of the template. For example, a non- 
complementary nucleotide fragment can be attached to 
the 5' end of the primer, with the remainder of the 
5 primer sequence being substantially complementary to 

the strand. Such non- complementary fragments 
typically code for an endonuclease restriction site. 
Alternatively, non- complementary bases or longer 
sequences can be interspersed into the primer, 

10 provided the primer sequence has sufficient 

complementarily with the sequence of the strand to be 
synthesized or amplified to non-randomly hybridize 
therewith and thereby form an extension product under 
polynucleotide synthesizing conditions. 

15 Primers of the present invention may also contain 

a DNA-dependent RNA polymerase promoter sequence or 
its complement. See for example, Krieg et al., Nucl . 
Acids Res. , 12:7057-70, 1984; Studier et al . , J • Mol ■ 
Biol . , 189:113-130, 1986; and Molecular Cloning: A 

20 Laboratory Manual- Second E dition, Maniatis et al . , 

eds., Cold Spring Harbor, NY, 1989. 

When a primer containing a DNA-dependent RNA 
polymerase promoter is used the primer is hybridized 
to the polynucleotide strand to be amplified and the 

25 second polynucleotide strand of the DNA-dependent RNA 

polymerase promoter is completed using an inducing 
agent such as E. coli DNA polymerase I, or the Klenow 
fragment of E. coli DNA polymerase. The starting 
polynucleotide is amplified by alternating between the 

30 production of an RNA polynucleotide and DNA 

polynucleotide. 

Primers may also contain a template sequence or 
replication initiation site for a RNA-directed RNA 
polymerase. Typical RNA-directed RNA polymerase 

35 include the QB replicase described by Lizardi et al . , 
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Biotechnology , 6:1197-1202, 1988. RNA-directed 
polymerases produce large numbers of RNA strands from 
a small number of template RNA strands that contain a 
template sequence or replication initiation site. 
These polymerases typically give a one million-fold 
amplification of the template strand as has been 
described by Kramer et al . , J. Mol . Biol . , 89:719-736, 
1974 . 

The polynucleotide primers can be prepared using 
any suitable method, such as, for example, the 
phosphotriester or phosphodiester methods see Narang 
et al., Meth. Enzymol . , 68:90, 1979; U.S. Patent No. 
4,356,270; and Brown et al . , Meth. Enzymol . , 68:109, 
1979. 

The choice of a primer's nucleotide sequence 
depends on factors such as the distance on the nucleic 
acid from the region coding for the desired receptor, 
its hybridization site on the nucleic acid relative to 
any second primer to be used, the number of genes it 
is to hybridize to, and the like. 

1) Primers for Producing Gene 
Libraries 

Gene libraries can be 
prepared prior to their utilization in the present 
invention. Library preparation is typically 
accomplished by primer extension, preferably by primer 
extension in a cDNA format. 

To produce a collection of DNA homologs by primer 
extension, the nucleotide sequence of a primer is 
selected to hybridize with a plurality of genes at a 
site defined by the addition of multiple poly (A) s in , 
the 3' untranslated region of the mRNA (polyA tail) so 
that a nucleotide sequence coding for a functional 
(capable of binding) polypeptide is obtained. To 
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hybridize to a plurality of different nucleic acid 
strands, the primer must be a substantial complement 
of a nucleotide sequence conserved among the different 
strands. Such sites include nucleotide sequences in 
the polyA tail and the like. 

Alternatively, if the repertoires of conserved 
receptor DNA homologs are to be produced by (PCR) 
amplification, two primers, i.e., a PCR primer pair, 
must be used for each coding strand of nucleic acid to 
be amplified. The first primer becomes part of the 
nonsense {minus or complementary) strand and 
hybridizes to a nucleotide sequence conserved among 
receptor (plus or coding) strands within the 
repertoire. To produce receptor- coding DNA homologs, 
first primers are therefore chosen to hybridize to 
(i.e. be complementary to) conserved regions within 
the gene. Second primers become part of the coding 
(plus) strand and hybridize to a nucleotide sequence 
conserved among minus strands . To produce the 
receptor- coding DNA homologs, second primers are 
therefore chosen to hybridize with a conserved 
nucleotide sequence at the 5' end of the receptor- 
coding gene. It should be noted that in the 
amplification of receptor-coding DNA homologs the 
conserved 5' nucleotide sequence of the second primer 
can be complementary to a sequence exogenously added 
using terminal deoxynucleotidyl transferase as 
described by Loh et al . , Science , 243:217-220, 1989. 
One or both of the first and second primers can 
contain a nucleotide sequence defining an endonuclease 
recognition site. The site can be heterologous to the 
receptor gene being amplified and typically appears at 
or near the 5' end' of the primer. 

When present, the restriction site-defining 
portion is typically located in a 5' -terminal non- 
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priming portion of the primer. The restriction site 
defined by the first primer is typically chosen to be 
one recognized by a restriction enzyme that does not 
recognize the restriction site defined by the second 
5 primer, the objective being to be able to produce a 

DNA molecule having cohesive termini that are non- 
complementary to each other and thus allow directional 
insertion into a vector. 

In one embodiment, the present invention utilizes 

10 a set of polynucleotides that form primers having a 

priming region located at the 3 '-terminus of the 
primer. The priming region is typically the 3 '-most 
(3 ' -terminal) 15 to 30 nucleotide bases. The 3'- 
terminal priming portion of each primer is capable of 

15 acting as a primer to catalyze nucleic acid synthesis, 

i.e., initiate a primer extension reaction off its 3' 
terminus. One or both of the primers can additionally 
contain a 5' -terminal (5' -most) non-priming portion, 
i.e., a region that does not participate in 

20 hybridization to the template. 

In PCR, each primer works in combination with a 
second primer to amplify a target nucleic acid 
sequence. The choice of PCR primer pairs for use in 
PCR is governed by considerations as discussed herein 

25 for producing gene libraries. That is, the primers 

have a nucleotide sequence that is complementary to a 
sequence in the gene to be amplified. 

2) Polymerase Chain Reaction to 
30 Produce Gene Libraries 

The strategy used for cloning 
a preselected gene will depend, as is well known in 
the art, on the type, complexity, and purity of the 
nucleic acids making up the genes. 
35 The preselected gene to be cloned is comprised of 
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polynucleotide coding strands, such as mRNA and/or the 
sense strand of genomic DNA. If the gene is in the 
form of double stranded genomic DNA, it is usually 
first denatured, typically by melting, into single 
strands. A DNA-encoded gene is subjected to a PGR 
reaction by treating (contacting) the DNA with a PCR 
primer pair, each member of the pair having a 
preselected nucleotide sequence. The PCR primer pair 
is capable of initiating primer extension reactions by 
hybridizing to nucleotide sequences, preferably at 
least about 10 nucleotides in length and more 
preferably at least about 20 nucleotides in length. 
The first primer of a PCR primer pair is sometimes 
referred to herein as the "sense primer" because it 
hybridizes to the coding or sense strand of a nucleic 
acid. In addition, the second primer of a PCR primer 
pair is sometimes referred to herein as the "anti- 
sense primer" because it hybridizes to a non-coding or 
anti-sense strand of a nucleic acid, i.e., a strand 
complementary to a coding strand. 

The PCR reaction is performed by mixing the PCR 
primer pair, preferably a predetermined amount 
thereof, with the nucleic acids, preferably a 
predetermined amount thereof, in a PCR buffer to form 
a PCR reaction admixture. The admixture is maintained 
under polynucleotide synthesizing conditions for a 
time period, which is typically predetermined, 
sufficient for the formation of a PCR reaction 
product, thereby producing a plurality of different 
DNA homologs . 

A plurality of first primer and/or a plurality of 
second primers can be used in each amplification, 
e.g., one species of first primer can be paired with a 
number of different second primers to form several 
different primer pairs. Alternatively, an individual 



- 76 - TSRI 432.0 

pair of first and second primers can be used. In any 
case, the amplification products of amplifications 
using the same or different combinations of first and 
second primers can be combined to increase the 
5 diversity of the gene library. 

In another strategy, the object is to clone a 
preselected gene by providing a polynucleotide 
complement of the genes, such as the anti-sense strand 
of genomic dsDNA or the polynucleotide produced by 
10 subjecting mRNA to a reverse transcriptase reaction. 

Methods for producing such complements are well known 
in the art . 

The PCR reaction is performed using any suitable 
method. Generally it occurs in a buffered aqueous 
15 solution, i.e., a PCR buffer, preferably at a pH of 7- 

9, most preferably about 8. Preferably, a molar 
excess (for genomic nucleic acid, usually about 10 6 :1 
primer : template) of the primer is admixed to the 
buffer containing the template strand. A large molar 

2 0 excess is preferred to improve the efficiency of the 

process . 

The PCR buffer also contains the deoxyribo- 
nucleotide triphosphates dATP , dCTP, dGTP, and dTTP 
and a polymerase, typically thermostable, all in 

25 adequate amounts for primer extension (polynucleotide 

synthesis) reaction. The resulting solution (PCR 
admixture) is heated to about 90°C - 100°C for about 1 
to 10 minutes, preferably from 1 to 4 minutes. After 
this heating period the solution is allowed to cool to 

30 48°C - 54°C, which is preferable for primer 

hybridization. The synthesis reaction may occur at 
from room temperature up to a temperature above which 
the polymerase (inducing agent) no longer functions 
efficiently. Thus, for example, if DNA polymerase is 

3 5 used as inducing agent, the- temperature is generally 
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no greater than about 4 0°C. An exemplary PGR buffer 
comprises the following: 50 mM KCl; 10 mM Tris-HCl; 
pH 8.3; 1.5 mM MgCl 2 ; 0.001% (wt/vol) gelatin, 200 fiM 
dATP; 200 fiM dTTP; 200 /xM dCTP; 200 /xM dGTP; and 2.5 
5 units Thermus aauaticus DNA polymerase I (U.S. Patent 

No. 4,889,818) per 100 microliters of buffer. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of 
primer extension products, including enzymes. 

10 Suitable enzymes for this purpose include, for 

example, E. coli DNA polymerase I, Klenow fragment of 
E. coli DNA polymerase I, T4 DNA polymerase, other 
available DNA polymerases, reverse transcriptase, and 
other enzymes, including heat-stable enzymes, which 

15 will facilitate combination of the nucleotides in the 

proper manner to form the primer extension products 
which are complementary to each nucleic acid strand. 
Generally, the synthesis will be initiated at the 3' 
end of each primer and proceed in the 5' direction 

2 0 along the template strand, until synthesis terminates, 

producing molecules of different lengths. There may 
be inducing agents, however, which initiate synthesis 
at the 5 ' end and proceed in the above direction, 
using the same process as described above. 
25 The inducing agent also may be a compound or 

system which will function to accomplish the synthesis 
of RNA primer extension products, including enzymes. 
In preferred embodiments, the inducing agent may be a 
DNA-dependent RNA polymerase such as T7 RNA 

3 0 polymerase, T3 RNA polymerase or SPG RNA polymerase. 

These polymerases produce a complementary RNA 
polynucleotide. The high turn over rate of the RNA 
polymerase amplifies the starting polynucleotide as 
has been described by Chamberlin et al . , The Enzymes , 
35 ed. P. Boyer, PP. 87-108, Academic Press, New York, 
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1982. Another advantage of T7 RNA polymerase is that 
mutations can be introduced into the polynucleotide 
synthesis by replacing a portion of cDNA with one or 
more mutagenic oligodeoxynucleotides (polynucleotides) 
5 and transcribing the partially-mismatched template 

directly as has been previously described by Joyce et 
al., Nuc. Acid Res. , 17:711-722, 1989. Amplification 
systems based on transcription have been described by 
Gingeras et al . , in PCR Protocols, A Guide to Methods 

10 and Applications, pp 245-252, Academic Press, Inc., 

San Diego, CA, 1990. 

If the inducing agent is a DNA- dependent RNA 
polymerase and therefore incorporates ribonucleotide 
triphosphates, sufficient amounts of ATP, CTP, GTP and 

15 UTP are admixed to the primer extension reaction 

admixture and the resulting solution is treated as 
described above . 

The newly synthesized strand and its 
complementary nucleic acid strand form a double- 

2 0 stranded molecule which can be used in the succeeding 

steps of the process. 

The first and/or second PCR reactions discussed 
above can advantageously be used to incorporate into - 
the gene a preselected epitope useful in 
25 immunologically detecting and/or isolating the protein 

expressed by the cloned gene. This is accomplished by 
utilizing a first and/or second polynucleotide 
synthesis primer or expression vector to incorporate a 
predetermined amino acid residue sequence into the 

3 0 amino acid residue sequence of the encoded protein. 

After producing DNA homologs for a plurality of 
different genes, the DNA molecules are typically 
further amplified. While the DNA molecules can be 
amplified by classic techniques such as incorporation 
35 into an autonomously replicating vector, it is 
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preferred to first amplify the molecules by subjecting 
them to a polymerase chain reaction (PCR) prior to 
inserting them into a vector. PCR is typically 
carried out by thermocycling i.e., repeatedly 
5 increasing and decreasing the temperature of a PCR 

reaction admixture within a temperature range whose 
lower limit is about 10 °C to about 40 °C and whose 
upper limit is about 90°C to about 100°C. The 
increasing and decreasing can be continuous, but is 

10 preferably phasic with time periods of relative 

temperature stability at each of temperatures favoring 
polynucleotide synthesis, denaturation and 
hybridization. 

PCR amplification methods are described in detail 

15 in U.S. Patent Nos . 4,683,195, 4,683,202, 4,800,159, 

and 4,965,188, and at least in several texts including 
"PCR Technology: Principles and Applications for DNA 
Amplification", H. Erlich, ed., Stockton Press, New 
York, 198 9; and "PCR Protocols: A Guide to Methods 

20 and Applications 11 , Innis et al . , eds . , Academic Press, 

San Diego, California, 1990. 

In preferred embodiments only one pair of first 
and second primers is used per amplification reaction. 
The amplification reaction products obtained from a 

25 plurality of different amplifications, each using a 

plurality of different primer pairs, are then 
combined. 

However, the present invention also contemplates 
DNA homolog production via co-amplification (using two 
30 pairs of primers) , and multiplex amplification (using 

up to about 8, 9. or 10 primer pairs) . 

In preferred embodiments, the PCR process is used 
not only to produce a library of DNA molecules, but 
also to induce mutations within the library or to 
3 5 create diversity from a single parental clone and 
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thereby provide a library having a greater 
heterogeneity. First, it should be noted that the PCR 
process itself is inherently mutagenic due to a 
variety of factors well known in the art. Second, in 
addition to the mutation inducing variations described 
in the above referenced U.S. Patent No. 4,683,195, 
other mutation inducing PCR variations can be 
employed. For example, the PCR reaction admixture, 
can be formed with different amounts of one or more of 
the nucleotides to be incorporated into the extension 
product. Under such conditions, the PCR reaction 
proceeds to produce nucleotide substitutions within 
the extension product as a result of the scarcity .of a 
particular base. Similarly, approximately equal molar 
amounts of the nucleotides can be incorporated into 
the initial PCR reaction admixture in an amount to 
efficiently perform X number of cycles, and then 
cycling the admixture through a number of cycles in 
excess of X, such as, for instance, 2X. 
Alternatively, mutations can be induced during the PCR 
reaction by incorporating into the reaction admixture 
nucleotide derivatives such as inosine, not normally 
found in the nucleic acids being amplified. During 
subsequent in vivo DNA synthesis and replication of 
the nucleic acids in a host cell, the nucleotide 
derivative will be replaced with a substitute 
nucleotide thereby inducing a mutation. 

b. Production of Gene Libraries Using 
cDNA Synthesis 
1) Preparation of RNA 

Total cellular RNA is 
prepared from appropriate ceils using the RNA 
preparation methods described by Chomczynski et al . , 
Anal Biochem., 162:156-159, 1987 and using the RNA 
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isolation kit: (Stratagene) according to the 
manufacturer's instructions. Briefly, immediately 
after isolation of the cells, the cells are 
homogenized in 10 ml of a denaturing solution 
5 containing 4.0 M guanine isothiocyanate , 0.25 M sodium 

citrate at pH 7.0, and 0.1 M beta-mercaptoethanol 
using a glass homogenizer. One ml of sodium acetate 
at a concentration of 2 M at pH 4.0 is admixed with 
the homogenized spleen. One ml of phenol that has 

10 been previously saturated with H 2 0 is also admixed to 

the denaturing solution containing the homogenized 
spleen. Two ml of a chloroform: isoamyl alcohol (24:1 
v/v) mixture is added to this homogenate. The 
homogenate is mixed vigorously for ten seconds and 

15 maintained on ice for 15 minutes. The homogenate is 

then transferred to a thick-walled 50 ml polypropylene 
centrifuge tube (Fisher Scientific Company, Pittsburg, 
PA) . The solution is centrifuged at 10,000 x g for 20 
minutes at 4°C. The upper RNA- containing aqueous 

20 layer is transferred to a fresh 50 ml polypropylene 

centrifuge tube and mixed with an equal volume of 
isopropyl alcohol. This solution is maintained at - 
2 0°C for at least one hour to precipitate the RNA. 
The solution containing the precipitated RNA is 

25 centrifuged at 10,000 x g for twenty minutes at 4°C. 

The pelleted total cellular RNA is collected and 
dissolved in 3 ml of the denaturing solution described 
above. Three ml of isopropyl alcohol is added to the 
re- suspended total cellular RNA and vigorously mixed. 

30 This solution is maintained at -20°C for at least 1 

hour to precipitate the RNA. The solution containing 
the precipitated RNA is centrifuged at 10,000 x g for 
ten minutes at 4°C. The pelleted RNA is washed once 
with a solution containing 75% ethanol . The pelleted 

35 RNA is dried under vacuum for 15 minutes and then re- 
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suspended in dimethyl pyrocarbonate (DEPC) treated 
(DEPC-H 2 0) H 2 0. 

Messenger RNA (mRNA) enriched for sequences 
containing long poly A tracts is prepared from the 
5 total cellular RNA using methods described in 

Molecular Cloning: A Laboratory Manual , Maniatis et 
al., eds., Cold Spring Harbor, NY, 1982. Briefly, one 
half of the total RNA isolated is re-suspended in one 
ml of DEPC-H 2 0 and maintained at 65 °C for five 

10 minutes. One ml of 2x high salt loading buffer 

consisting of 100 mM Tris-HCl (Tris [hydroxymethyl] 
amino methane hydrochloride) , 1 M sodium chloride 
(NaCl) , 2.0 mM disodium ethylene diamine tetra-acetic 
acid (EDTA) at pH 7.5, and 0.2% sodium dodecyl sulfate 

15 (SDS) is added to the re -suspended RNA and the mixture 

allowed to cool to room temperature. The mixture is 
then applied to an oligo-dT (Collaborative Research 
Type 2 or Type 3) column that has been previously 
prepared by washing the oligo-dT with a solution 

2 0 containing 0.1 M sodium hydroxide and 5 mM EDTA and 

then equilibrating the column with DEPC-H 2 0. The 
eluate is collected in a sterile polypropylene tube 
and reapplied to the same column after heating the 
eluate for 5 minutes at 65°C. The oligo dT column is. 
25 then washed with 2 ml of high salt loading buffer 

consisting of 50 mM Tris-HCl, pH 7.5, 500 mM sodium 
chloride, 1 mM EDTA at pH 7.5 and 0.1% SDS. The oligo 
dT column is then washed with 2 ml of 1 X medium salt 
buffer consisting of 50 mM Tris-HCl, pH 7.5, 100 mM, 1 

3 0 mM EDTA and 0.1% SDS. The messenger RNA is eluted 

from the oligo dT column with 1 ml of buffer 
consisting of 10 mM Tris-HCl, pH 7,5, 1 mM EDTA, at pH 
7.5, and 0.05% SDS. The messenger RNA is purified by 
extracting this solution with phenol/ chloroform 
35 followed by a single extraction with 100% chloroform. 
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The messenger RNA is concentrated by ethanol 
precipitation and re-suspended in DEPC H 2 0. 

The messenger RNA (mRNA) isolated by the above 
process contains a plurality of different 
polynucleotides, i.e., greater than about 10 4 
different genes. Thus, the mRNA population represents 
a repertoire of genes. 

2) Preparation of DNA Homolocrs 
In preparation for PCR 
amplification, mRNA prepared above is used as a 
template for cDNA synthesis by a primer extension 
reaction. In a typical 50 fil transcription reaction, 
5-10 fig of mRNA in water is first hybridized 

(annealed) with 500 ng (50.0 pmol) of a 3 7 primer, at 
65°C for five minutes. Suitable 3' primers comprise 
primers designed to specifically anneal to a single 
mRNA transcript or those primers designed to anneal to 
multiple transcripts, e.g. polyT primers. 
Subsequently, the mixture is adjusted to 1.5 mM dATP, 
dCTP, dGTP and dTTP, 4 0 mM Tris-HCl, pH 8.0, 8 mM 
MgCl 2/ 5 0 mM NaCl, and 2 mM spermidine. Moloney- 
Murine Leukemia virus Reverse transcriptase 

(Stratagene) , 26 units, is added and the solution is 
maintained for 1 hour at 37 °C. 

PCR amplification is performed in a 100 /xl 
reaction containing the products of the reverse 
transcription reaction (approximately 5 fig of the 
c DNA/ RNA hybrid), 300 ng of 3' primer, 300 ng of the 
5' primer, 200 mM of a mixture of dNTP's, 50 mM KCl , 
10 mM Tris-HCl pH 8 . 3 , 1 . 5 mM MgCl 2/ 0 .1% gelatin and 
2.5 units of Thermus aauaticus (Taq) DNA polymerase. 
The reaction mixture is overlaid with mineral oil and 
subjected to 40 cycles of amplification. Each 
amplification cycle includes denaturation at 92 °C for 
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1 minute, annealing at 52 °C for 2 minutes and 
polynucleotide synthesis by primer extension 
(elongation) at 72°C for 1.5 minutes. The amplified 
DNA homolog containing samples are then extracted 
twice with phenol/chloroform, once with chloroform, 
ethanol precipitated and are stored at -70°C in 10 mM 
Tris-HCl, pH 7.5, and 1 mM EDTA. 

Using 5' and 3' primers, efficient DNA homolog 
synthesis and amplification from the mRNA is achieved 
as shown by agarose gel electrophoresis. Following 
the above procedures, a gene library is constructed 
from the products of PCR amplifications. Equal 
portions of the products from different primer 
extension reactions may be mixed and the mixed product 
is used to generate a library of DNA homolog- 
containing vectors. 

3 . Linear DNA Expression Vectors 

A DNA expression vector for use in a 
method of the invention for producing a library of DNA 
molecules is a linearized DNA molecule as described 
herein having two {upstream and downstream) cohesive 
termini adapted for ligation to r a polypeptide gene. 

A linear DNA expression vector is typically 
prepared by restriction endonuclease digestion of a 
circular or linear DNA expression vector of this 
invention to cut at two preselected restriction sites 
within the sequence of nucleotides of the vector 
adapted for ligation to produce a linear DNA molecule 
having the required cohesive termini that are adapted 
for ligation. A first terminus of the vector is 
complementary to both termini of the insert and to the 
second terminus of the vector. 

A preferred DNA expression vector for use in a 
method of the invention for producing a library of DNA 



- 85 - TSRI 432.0 

molecules is a linearized DNA molecule as described 
herein having two (upstream and downstream) cohesive 
termini adapted for ligation to a polypeptide gene. 

A preferred linear DNA expression vector is 
typically prepared by restriction endonuclease 
digestion of a circular or linear DNA expression 
vector of this invention to cut at two preselected 
restriction sites within the sequence of nucleotides 
of the vector adapted for ligation to produce a linear 
DNA molecule having the required cohesive termini that 
are adapted for ligation. 

4 . Ligation Reactions to Produce Gene 
Libraries 

To prepare an expression library of 
sequences, DNA homologs are prepared as described 
herein. The resulting PCR amplified products (2.5 
Atg/3 0 fil of 150 mM NaCl, 8 mM Tris-HCl, pH 7.5, 6 mM 
MgS0 4/ 1 mM DTT, 2 00 /ig/ml BSA) are digested at 37 °C 
with appropriate restriction enzymes such as those 
contained within the multiple cloning site of the 
lambda vectors of this invention. In cloning 
experiments which required a mixture of the products 
of the amplification reactions, equal volumes (50 fxl, 
1-10 fig concentration) of each reaction mixture are 
combined after amplification but before restriction 
digestion. The homologs are purified on a 1% agarose 
gel using the standard electro-elution technique 
described in Molecular Cloning A Laboratory Manual , 
Maniatis et al . , eds . , Cold Spring Harbor, NY, 1982. 
After gel electrophoresis of the digested PCR 
amplified mRNA, the region of the gel containing DNA 
fragments of the desired base pairs are excised, 
electro-eluted into a dialysis membrane, ethanol 
precipitated and re-suspended in a TE solution 
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containing 10 mM Tris-HCl, pH 7 . 5 and 1 mM EDTA to a 
final concentration of 50 ng//il - The resulting DNA 
homologs represent a repertoire of polypeptide genes 
having cohesive termini adapted for directional 
5 ligation to the vector Xfoo. These prepared DNA 

homologs are then directly inserted by directional 
ligation into linearized Xfoo expression vector 
prepared as described below. 

The Xfoo expression DNA vector is prepared for 

10 inserting a DNA homolog by admixing 100 /ig of this DNA 

to a solution containing 250 units each of appropriate 
restriction endonucleases and a buffer recommended by 
the manufacturer. This solution is maintained at 37°C 
for 1.5 hours. The solution is heated at 65 °C for 15 

15 minutes to inactivate the restriction endonucleases. 

The solution is chilled to 30°C and 25 units of heat- 
killable (HK) phosphatase (Epicenter, Madison, WI) and 
CaCl 2 is admixed to it according to the manufacturer's 
specifications. This solution is maintained at 30°C 

20 for 1 hour. The DNA is purified by extracting the 

solution with a mixture of phenol and chloroform 
followed by ethanol precipitation. The Xfoo 
expression vector is now ready for ligation to the DNA 
homologs prepared as described herein. These prepared 

25 DNA homologs are then directly inserted into the 

restriction digested Xfoo expression vector that 
prepared above by ligating 3 moles of DNA homolog 
inserts with each mole of the Xfoo expression vector 
overnight at 5°C. Approximately 3.0 x 10 5 plague 

30 forming units are obtained after packaging the DNA 

with Gigapack® II Gold (Stratagene} of which >10% are 
recombinants. The ligation mixture containing the DNA 
homologs are packaged according to the manufacturers 
specifications using Gigapack® Gold II Packing Extract 

35 (Stratagene) . The resulting Xfoo expression libraries 
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are then transformed into XLl-Blue cells. 

In preparing a library of DNA molecules of this 
invention, a ligation admixture is prepared as 
described above, and the admixture is subjected to 
ligation conditions for a time period sufficient for 
the admixed collection of polypeptide genes to ligate 
(become operatively linked) to the plurality of DNA 
expression vectors to form the library. 

Ligation conditions are conditions selected to 
favor a ligation reaction wherein a phosphodiester 
bond is formed between adjacent 3 ' hydroxyl and 5' 
phosporyl termini of DNA. The ligation reaction is 
preferably catalyzed by the enzyme T4 DNA ligase. 
Ligation conditions can vary in time, temperature, 
concentration of buffers, quantities of DNA molecules 
to be ligated, and amounts of ligase, as is well 
known . Preferred ligation conditions involve 
maintaining the ligation admixture at 4°C to 25 °C for 
1 to 24 hours in the presence of 1 to 10 units of T4 
DNA ligase per milliliter (ml) and about 0 . 1 to 2 
micrograms (/xg) of DNA. Ligation buffer in a ligation 
admixture typically contains 0.5 M Tris-HCl (pH 7.4), 
0.01 M MgCl 2 , 0.01 M dithiothrietol , 1 mM spermidine, 
1 mM ATP and 0.1 mg/ml bovine serum albumin (BSA) . 
Other ligation buffers can also be used. 

Exemplary ligation reactions are described in the 
Examples. 

In a particularly preferred embodiment, the 
present invention contemplates methods for the 
preparation of a library of DNA molecules having one 
or more cistrons for expressing a fusion protein of 
this invention. In preferred embodiments, multiple 
cistrons are operatively linked at relative locations 
on the DNA molecule such that the cistrons are under 
the transcriptional control of a single promoter. 
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Each multicistronic molecule is capable of expressing 
first, second, and optionally third polypeptides from 
first, second, and optional third overlapping 
cistrons, respectively, that can assemble to form, in 
a suitable host, a multimeric receptor on the surface 
of a lambdoid phage particle. 

The method for producing a library of 
multicistronic DNA molecules comprises the steps of: 

(a) Forming a ligation admixture by 
combining in a ligation buffer: 

(i) polypeptide genes in the form 
of dsDNA, each having cohesive termini adapted for 
ligation, and 

(ii) a plurality of DNA expression 
vectors in linear form, each having upstream and 
downstream first cohesive termini that are (a) adapted 
for receiving the polypeptide genes in a common 
reading frame, and (b) operatively linked to 
respective upstream translatable DNA sequences. The 
upstream translatable DNA sequence encodes a lambdoid 
phage matrix anchor protein and linker polypeptide, 
and translatable DNA sequences are operatively linked 
to respective upstream and downstream DNA expression 
control sequences . 

(b) Subjecting the admixture to 
ligation conditions for a time period sufficient to 
operatively link the polypeptide genes to the vectors 
and produce a plurality of DNA molecules each having a 
first cistron for expressing the lambdoid phage matrix 
anchor polypeptide, a second cistron for expressing 
■the fusion polypeptide, and an optional third cistron 
for expressing the monomeric polypeptide. 

In preferred embodiments the lambdoid phage 
matrix anchor protein is a portion of the lambda pV 
tail protein. Also preferred is the use of a linker 
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peptide that is derived from C. fimi endo-S-1,4- 
glucanase as described herein and contains sequences 
for the expression of the downstream inserted DNA 
sequences . 

5 DNA expression vectors useful for practicing the 

above method are the expression vectors described in 
greater detail before. 

In practicing the method of producing a library 
of multicistronic DNA molecules, it is preferred that 

10 the upstream and downstream first cohesive termini do 

not have the same nucleotide sequences as the upstream 
and downstream second cohesive termini. In this 
embodiment, the treating step (c) to linearize the 
circular DNA molecules typically involves the use of 

15 restriction endonucleases that are specific for 

producing said second termini, but do not cleave the 
circular DNA molecule at the sites that formed the 
first termini. Exemplary and preferred first and 
second termini are the termini defined by cleavage of 

2 0 Xfoo with Hindi I I and Bam HI to form the upstream and 

downstream first termini. In this embodiment, other 
pairs of cohesive termini can be utilized at the 
respective pairs of first and second termini, so long 
as the termini are each distinct, non- complementary 
25 termini. Exemplary are the termini found on the 

vectors XV' sac, XV'mcs, Xblue, and Xfoo described 
herein . 

Methods of treating the plurality of DNA 
molecules under DNA cleavage conditions to form linear 

3 0 DNA molecules with cohesive termini are generally well 

known and depend on the nucleotide sequence to be 
cleaved and the mechanism for cleavage. Preferred 
treatments involve admixing the DNA molecules with a 
restriction endonuclease specific for a endonuclease 
35 recognition site at the desired cleavage location in 
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an amount sufficient for the restriction endonuclease 
to cleave the DNA molecule. Buffers, cleavage 
conditions, and substrate concentrations for 
restriction endonuclease cleavage are well known and 
depend on the particular enzyme utilized. Exemplary 
restriction enzyme cleavage conditions are described 
in the Examples . 

5 . Methods for Changing the Diversity of a 
Library 

The present invention provides methods 
for changing the diversity of a library of lambdoid 
phage of this invention. These methods generally 
increase the diversity of the library, thereby 
increasing the pool of possible biologically active 
complexes from which to screen for a desired activity. 
Alternatively, the methods can be directed at 
enriching for a class of ligand-binding or substrate- 
binding complexes. The class is typically defined by 
the ability to bind a particular ligand or family of 
ligands, or the ability to bind a particular substrate 
or family of substrates. 

a. Increasing Library Diver sity bv 
Mutation 

A particularly preferred method 
for increasing diversity is to alter the amino acid 
residue sequence of one or more polypeptides of the 
ligand- or substrate-binding complex encoded by the 
genome of a phage of this invention. Alterations can 
be conveniently introduced at the nucleic acid level 
by mutation of the nucleic acid. The method can be 
practiced on a single species of nucleic acid coding a 
polypeptide of this invention, or can be practiced on 
a library of nucleic acids present in a library of 
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phage of this invention. 

Mutation of nucleic acid can be conducted by a 
variety of means, but is most conveniently conducted 
in a PCR reaction during a PCR process of the present 
5 invention. PCR mutagenesis can be random or directed 

to specific nucleotide sequences, as is generally well 
known. Conducting PCR under conditions favorable to 
random mutagenesis has been described previously, can 
involve the use of degenerate primers, by conditions 

10 of low stringency hybridization allowing mismatch such 

as by "error prone PCR". Similarly, directed 
mutagenesis involves the use of PCR primers designed 
to target a specific type of mutation into a specific 
region of nucleotide sequence. 

15 in one embodiment, the invention contemplates 

increasing diversity of one or more ligand- or 
substrate-binding complexes by PCR-directed mutation 
of a domain present in a ligand- or substrate binding 
complex polypeptide of this invention. 

2 0 Thus the invention contemplates a mutagenesis 

method for altering the specificity of a cloned gene 
present in a DNA vector of this invention. The method 
provides directed mutagenesis in a preselected domain 
of a gene which comprises subjecting a recombinant DNA 
25 molecule (rDNA) containing the cloned gene having a 

target domain to PCR conditions suitable for 
amplifying a preselected region of the domain. In the 
method, the rDNA molecule is subjected to PCR 
conditions that include a PCR primer oligonucleotide 

3 0 as described below constituting the first primer in a 

PCR primer pair as is well known to produce an 
amplified PCR product that is derived from the 
preselected domain but that includes the nucleotide 
sequences of the PCR primer. The second 
35 oligonucleotide in the PCR amplifying conditions can 
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be any PCR primer derived from the gene to be 
mutagenized, as described herein. 

Preferred are methods using an oligonucleotide of 
this invention as described below. 
5 The length of the 3' and 5' terminal nucleotide 

sequences of a subject mutagenizing oligonucleotide 
can vary in length as is well known, so long as the 
length provides a stretch of nucleotides complementary 
to the target sequences as to hybridize thereto. In 

10 the case of the 3' terminal nucleotide sequence, it 

must be of sufficient length and complementarity to 
the target region located 3' to the region to be 
mutagenized as to hybridize and provide a 3' hydroxyl 
terminus for initiating a primer extension reaction. 

15 In the case of the 5' terminal nucleotide sequence, it 

must be of sufficient length and complementarity to 
the target region located 5' to the region to be 
mutagenized as to provide a means for hybridizing in a 
PCR overlap extension reaction as described above to 

20 assemble the complete polypeptide coding sequence. 

Preferably, the length of the 3' and 5' terminal 
nucleotide sequences are each at least 6 nucleotides 
in length, and can be up to 50 or more nucleotides in 
length, although these lengths are unnecessary to 

25 assure accurate and reproducible hybridization. 

Preferred are lengths in the range of 12 to 3 0 
nucleotides, and typically are about 18 nucleotides. 

The nucleotide sequence located between the 3' 
and 5' termini adapted for mutagenizing a domain can 

3 0 be any nucleotide sequence, insofar as the novel 

sequence will be incorporated by the above methods. 
However, the present approach provides a means to 
produce a large population of mutagenized domains in a 
single PCR reaction by the use of a population of 

3 5 redundant sequences defining randomized or nearly 
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randomized nucleotides in the region to be 
mutagenized. 

Thus, the invention contemplates a method for 
increasing the diversity of a library of lambdoid 
phage particles comprising the steps of: a) providing 
a library of lambdoid phage particles according to the 
present invention, and b) mutating the ligand- or 
substrate -binding coding nucleotide sequence present 
in each DNA expression vector in the library to form a 
library of phage particles each containing a mutated 
nucleotide sequence . 

The method of mutation can include manipulating 
the genomes of the phage particles in the library in 
order to isolate the nucleic acids in preparation for 
a mutagenizing PCR reaction. Manipulations of a phage 
library to isolate the phage genome for use in a PCR 
reaction is well known in the art. 

In one embodiment , the method of mutation 
comprises subjecting the ligand- or substrate-binding 
coding nucleotide sequence to an error-prone 
polymerase chain reaction. In another embodiment, the 
method of mutation comprises subjecting the ligand- or 
substrate-binding coding nucleotide sequence to a 
method for mutating a region (domain) of the ligand- 
or substrate-binding coding nucleotide sequence using 
a region-directed oligonucleotide as described herein. 

J - Method for Screening Proteins on the Surface 
of Lambda Phage Particles 

Because the biologically active multimeric 
complex is linked to the phage in a surface accessible 
manner, the phage expressing the desired biologically 
active multimeric complex on its surface can be 
advantageously segregated from a population of phage 
by the use of a solid-phase substrate or ligand 
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molecule. In preferred embodiments, the substrate or 
ligand molecule is linked, to a solid (aqueous 
insoluble) matrix such as agarose, cellulose, 
synthetic resins, polysaccharides and the like. For 
5 example, phage with the biologically active multimeric 

complex in a surface accessible manner can be 
segregated using immunoaf f inity techniques such as 
panning of phage particles against a preselected 
substrate or ligand. In the panning method, substrate 

10 or ligand is applied to and retained in a microtiter 

dish and maintained under conditions that support 
retention of the substrate or ligand. An aqueous 
composition containing a population of phage, each 
expressing a biologically active multimeric complex in 

15 a surface accessible manner, is then contacted with 

the substrate or ligand under conditions that favor 
formation of a solid-phase substrate -enzyme or ligand- 
receptor complex. The dish is then washed to remove 
unbound phage, leaving the phage bound to the solid- 

20 phase substrate or ligand. The phage can then be 

removed and recovered by washing the dish with a 
buffer that promotes dissociation of the substrate- 
enzyme or ligand-receptor complex. Alternatively, the 
phage can be removed by digestion with a protease. 

25 Phage whose genome encodes the desired biologically 

active multimeric complex can thus be segregated from 
a population of phage. 

The method of panning can be repeated by: 1) 
amplification of the removed phage particles by 

30 infection of a suitable host, 2) preparation of phage 

particles with biologically active multimeric 
complexes in a surface accessible manner, 3) 
immunoadsorption to a preselected substrate or ligand, 
and 4) removal of said immunoadsorbed phage particles. 

35 Thus, the method of panning to segregate the desired 
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phage can be repeated multiple times to further enrich 
the population for phage particles with the desired 
biologically active multimeric complex. 

Alternatively, a population of purified phage 
with a biologically active multimeric complex in a 
surface accessible manner can be admixed with a 
aqueous solution containing a preselected substrate or 
ligand. The substrate/enzyme or receptor/ligand 
binding reaction admixture thus formed is maintained 
for a time period and under binding conditions 
sufficient for a phage-linked enzyme- substrate or 
receptor-ligand complex to form. The phage-bound 
substrate (substrate -bearing) or ligand (ligand- 
bearing phage) are then separated and recovered from 
the unbound materials, such as by centrif ugation, 
electrophoresis, precipitation, and the like. 

A substrate or ligand analogue may be substituted 
for the substrate or ligand in the method of 
segregating a phage with the desired biologically 
active multimeric complex. 

Phage expressing the desired biologically active 
multimeric complex on its surface can be 
advantageously segregated from a population of phage 
by the use of a solid-phase antibody molecule. In 
preferred embodiments, the antibody molecule is 
linked, to a solid (aqueous insoluble) matrix such as 
agarose, cellulose, synthetic resins, polysaccharides 
and the like. For example, phage with the 
biologically active multimeric complex in a surface 
accessible manner can be segregated using 
immunoaf f inity techniques such as panning of phage 
particles against a preselected antibody 
immunoreactive with an epitope on the multimeric 
complex. In the panning method, antibody is applied 
to and retained in a microtiter dish and maintained 
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under conditions that support retention of the 
antibody. An aqueous composition containing a 
population of phage, each expressing a biologically 
active multimeric complex in a surface accessible 
5 manner, is then contacted with the antibody under 

conditions that favor formation of a solid-phase 
multimeric complex-antibody complex. The dish is then 
washed to remove unbound phage, leaving the phage 
bound to the solid-phase antibody. The phage can then 

10 be removed and recovered by washing the dish with a 

buffer that promotes dissociation of the multimeric 
complex-antibody complex. Alternatively, the phage 
can be removed by digestion with a protease. Phage 
whose genome encodes the desired biologically active 

15 multimeric complex can thus be segregated from a 

population of phage. This process can be repeated to 
enrich the phage population for the desired activity 
as described herein for the panning method. 

Phage expressing a desired catalytically active 

20 multimeric complex can be identified in a population 

of phage by catalysis of a preselected substrate. In 
preferred embodiments, the substrate molecule is 
contacted with the phage particle and maintained under 
conditions that support catalysis of the substrate to 

25 form a detectable product or products. Phage whose 

genome encodes a desired catalytically active 
multimeric complex can thus be identified in a 
population of phage particles. 

30 K. Diagnostic Methods and Systems 

The present invention contemplates various 
assay methods that would utilize the biological 
activity of the fusion protein of the present 
invention, for example to detect the presence, and 

35 preferably amount, of a preselected receptor, ligand, 
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enzyme or substrate (i.e., target material) with which 
the fusion protein binds or reacts. The assay can be 
conducted on a sample such as a biological fluid or 
tissue sample using a fusion protein or phage 
displaying a fusion protein of this invention as an 
reagent to form an product whose amount relates, 
either directly or indirectly, to the amount of 
material (target material) to be detected or measured 
in the sample. 

Those skilled in the art will understand that 
there are numerous well known clinical diagnostic 
chemistry procedures in which a reagent of this 
invention can be used to form an reaction product 
whose amount relates to the amount of target material 
present in a body sample. Thus, while exemplary assay 
methods are described herein, the invention is not so 
limited. 

Various heterogenous and homogeneous protocols, 
either competitive or noncompetitive, can be employed 
in performing an assay method of this invention. 
Examples of types of assays which can utilize a fusion 
protein of the invention include competitive and 
non- competitive binding assays in either a direct or 
indirect format. Examples of such assays are the 
■ELISA assays, binding assays, radioimmunoassay (RIA) , 
the sandwich (immunometric) assay, and the like. 
Detection of a target material using the fusion 
protein of the invention can be done utilizing assays 
which are run in either the forward, reverse, or 
simultaneous modes, including histochemical assays on 
physiological samples. Those of skill in the art will 
know, or can readily discern, other assay formats 
without undue experimentation. 

In addition, solid or liquid phase enzyme assays, 
are contemplated in which the fusion protein has 
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catalytic activity that are useful for detecting the 
presence of substrate by generation of product. 

The fusion protein of the invention can be bound 
to many different carriers and used to detect the 
presence of a target material. Examples of well-known 
carriers include glass, polystyrene, polypropylene, 
polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides , agaroses and 
magnetite. The nature of the carrier can be either 
soluble or insoluble for purposes of the invention. 
Those skilled in the art will know of other suitable 
carriers for binding fusion protein, or will be able 
to ascertain such, using routine experimentation. 

There are many different labels and methods of 
labeling known to those of ordinary skill in the art. 
Examples of the types of labels which can be used in 
the present invention include enzymes, radioisotopes, 
fluorescent compounds, colloidal metals, 
chemiluminescent compounds, and bio-luminescent 
compounds. Those of ordinary skill in the art will 
know of other suitable labels for binding to the 
monoclonal antibodies of the invention, or will be 
able to ascertain such, using routine experimentation. 
Furthermore, the binding of these labels to the fusion 
protein or phage particle of the invention can be done 
using standard techniques common to those of ordinary 
skill in the art. 

For purposes of the invention, a target material 
may be detected by the fusion protein of the invention 
when present in samples of biological fluids and 
tissues. Any sample containing a detectable amount of 
target material can be used. A sample can be a liquid 
such as a natural or man-made source, industrial or 
biological waste or by product, urine, saliva, 
cerebrospinal fluid, blood, serum and the like, or a 
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solid or semi-solid such as tissues, feces, and the 
like, or, alternatively, a solid tissue such as those 
commonly used in histological diagnosis. 

Another labeling technique which may result in 
5 greater sensitivity consists of coupling the fusion 

protein to low molecular weight haptens . These 
haptens can then be specifically detected by means of 
a second reaction. For example, it is common to use 
haptens such as biotin, which reacts with avidin, or 

10 dinitrophenol, pyridoxal, or fluorescein, which can 

react with specific anti-hapten antibodies. 

A further diagnostic method utilizes the 
multivalency of a lambdoid phage particle to cross- 
link ligand, thereby forming an aggregation of 

15 multiple ligands and phage particles, producing a 

precipitable aggregate. This embodiment is comparable 
to the well known methods of immune precipitation. 
This embodiment comprises the steps of admixing a 
sample with a plurality of phage particles of this 

2 0 invention to form a binding admixture under binding 

conditions, followed by a separation step to isolate 
the formed binding complexes. Typically, isolation is 
accomplished by centrifugation or filtration to remove 
the aggregate from the admixture. The presence of 
25 binding complexes indicates the presence of the 

preselected ligand to be detected. 

Thus, in a preferred embodiment, a method for 
detecting the presence of a preselected target in a 
sample comprises the steps of: 

3 0 a) admixing a sample containing a 

preselected target with a recombinant lambdoid 
bacteriophage of this invention, wherein the 
preselected polypeptide defines a biologically active 
ligand or receptor able to bind the preselected 
35 target, under binding conditions sufficient for the 
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target -binding bacteriophage to bind the target and 
form a target -ligand or receptor complex; 

b) detecting the presence of the complex/ 
and thereby the presence of the preselected target. 

The binding conditions can vary widely depending 
upon the type of binding activity of the fusion 
protein, which in turn depends on the type of binding 
ligand, as is well appreciated in the art. Because 
the invention is not directed to the particular type 
of biological activity being displayed, it is to be 
appreciated that the binding conditions are not to be 
so limited based on a particular ligand. 

The detection of the complex or bound target can 
be accomplished by a variety of means as discussed 
earlier. However, in one preferred embodiment the 
detecting step can comprise detecting the presence of 
the bacteriophage particles, such as with an antibody 
specific for a phage antigen, and thereby detect the 
presence of the preselected target. 

The present invention also describes a diagnostic 
system, preferably in kit form, for assaying for the 
presence of a preselected substrate in a sample where 
it is desirable to detect the presence, and preferably 
the amount, of the substrate in a sample according to 
the diagnostic methods described herein. 
Alternatively, the present invention also describes a 
diagnostic system, preferably in kit form, for 
assaying for the presence of a preselected ligand, or 
antigen, in a sample where it is desirable to detect 
the presence, and preferably the amount, of the ligand 
or antigen in a sample according to the diagnostic 
methods described herein. 

In another embodiment, a diagnostic system is 
contemplated for assaying for the presence of an 
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enzyme or receptor in a body fluid sample such as for 
monitoring the fate of therapeutically administered 
enzyme or receptor. The system includes, in an amount 
sufficient for at least one assay, an enzyme or 
5 receptor as a control reagent, and preferably a 

preselected amount of substrate or ligand, each as 
separately packaged immunochemical reagents . 

The sample can be a tissue, tissue extract, fluid 
sample or body fluid sample, such as blood, plasma or 

10 serum. 

The diagnostic system includes, in an amount 
sufficient to perform at least one assay, a lambdoid 
phage or substrate- or ligand-binding multimeric 
receptor according to the present invention, as a 

15 separately packaged reagent. 

Exemplary diagnostic systems for detecting a 
preselected substrate or ligand in the solid phase and 
utilizing a lambdoid phage of this invention are 
described in the Examples. 

20 In embodiments for detecting substrate or ligand 

in a body fluid, a diagnostic system of the present 
invention can include a label or indicating means 
capable of signaling the formation of an substrate- or 
ligand-binding complex containing a bacteriophage 

25 displaying an enzyme or ligand of the present 

invention . 

Phage of this invention can be labeled when used 
in a diagnostic method of this invention. Preferred 
labels include radioactively labeled nucleic acids 

3 0 incorporated into the phage genome, or radioactively 

labeled amino acids incorporated into protein 
components of the phage particle. Preparation of 
labeled phage can be routinely prepared by growing 
phage as described herein, but including radiolabeled 

35 nucleotides or radiolabeled amino acids in the culture 
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medium for incorporation into nucleic acids or 
polypeptides of the phage, respectively. Exemplary 
labels are 3 H-thymidine or 35 S-methionine . Other 
isotopic labels and other nucleotide or amino acid 
5 precursors are readily available to one skilled in the 

art. The labeled phage preferably contains sufficient 
label to be detectable in a ligand-binding assay of 
this invention, i.e., the phage is detectably labeled. 
As used herein, the term "package" refers to a 

10 solid matrix or material such as glass, plastic (e.g., 

polyethylene, polypropylene or polycarbonate) , paper, 
foil and the like capable of holding within fixed 
limits a multimeric receptor, lambdoid phage or 
library of phage of the present invention. Thus, for 

15 example, a package can be a glass vial used to contain 

milligram quantities of a labeled phage preparation, 
or it can be a microtiter plate well to which 
microgram quantities of a contemplated receptor, 
ligand, antibody or phage particle (s) have been 

20 operatively affixed, i.e., linked so as to be capable 

of binding a substrate or ligand. 

A diagnostic system of the present invention 
preferably also includes a label or indicating means 
capable of signaling the formation of a binding 

25 reaction complex containing a substrate- or ligand- 

binding multimeric receptor or phage complexed with 
the preselected substrate or ligand. 

The word "complex" as used herein refers to the 
product of a specific binding reaction such as an 

3 0 phage-substrate or receptor-substrate reaction. 

Alternatively, the word "complex" as used herein 
refers to the product of a specific binding reaction 
such as an phage-ligand or receptor-ligand reaction. 
Exemplary complexes are substrate -enzyme or receptor- 

3 5 ligand products. 



- 103 - TSRI 432.0 

As used herein, the terms "label" and "indicating 
in their various grammatical forms refer to 
single atoms and molecules that are either directly or 
indirectly involved in the production of a detectable 
5 signal to indicate the presence of a complex. Any 

label or indicating means can be linked to or 
incorporated in an expressed polypeptide, or phage 
particle that is used in a diagnostic method. Such 
labels are themselves well-known in clinical 
10 diagnostic chemistry and constitute a part of this 

invention only insofar as they are utilized with 
otherwise novel proteins methods and/or systems. 

The labeling means can be a fluorescent labeling 
agent that chemically binds to antibodies or antigens 
15 without denaturing them to form a fluorochrome (dye) 

that is a useful immunof luorescent tracer. Suitable 
fluorescent labeling agents are f luorochromes such as 
fluorescein isocyanate (FIC) , fluorescein 
isothiocyante (FITC) , 5-dimethylamine-l- 

2 0 naphthalenesulf onyl chloride (DANSC) , 

tetramethylrhodamine isothiocyanate (TRITC) , 
lissamine, rhodamine 8200 sulphonyl chloride (RB 200 
SC) and the like. A description of immunofluorescence 
analysis techniques is found in DeLuca, 
25 "Immunofluorescence Analysis", in Antibody As a Tool , 

Marchalonis, et al . , eds . , John Wiley & Sons, Ltd., 
pp. 189-231, 1982, which is incorporated herein by 
reference . 

In preferred embodiments, the indicating group is 

3 0 an enzyme, such as horseradish peroxidase (HRP) , 

glucose oxidase, or the like. In such cases where the 
principal indicating group is an enzyme such as HRP or 
glucose oxidase , additional reagents are required to 
visualize the fact that a substrate-enzyme or 
35 receptor-ligand complex has formed. Such additional 
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reagents for HRP include hydrogen peroxide and an 
oxidation dye precursor such as diaminobenzidine . An 
additional reagent useful with glucose oxidase is 
2,2' -amino-di- (3-ethyl-benzthiazoline-G-sulf onic acid) 
(ABTS) . 

Radioactive elements are also useful labeling 
agents and are used illustratively herein. An 
exemplary radiolabeling agent is a radioactive element 
that produces gamma ray emissions. Elements which 
themselves emit gamma rays, such as 124 I, 125 I, 128 I, 132 I 
and 51 Cr represent one class of gamma ray emission- 
producing radioactive element indicating groups. 
Particularly preferred is 125 I. Another group of 
useful labeling means are those elements such as 1:L C, 
18 F, 15 0 and 13 N which themselves emit positrons. The 
positrons so emitted produce gamma rays upon 
encounters with electrons present in the animal's 
body. Also useful is a beta emitter, such 111 indium 
of 3 H. 

The linking of labels, i.e., labeling of, 
polypeptides and proteins is well known in the art. 
For instance, proteins or phage can be labeled by 
metabolic incorporation of radioisotope-containing 
amino acids provided as a component in the culture 
medium. See, for example, Galfre et al . , Meth. 
Enzvmol . , 73:3-46, 1381. The techniques of protein 
conjugation or coupling through activated functional 
groups are particularly applicable. See, for example, 
Aurameas, et al . , Scand . J . Immunol . , Vol. 8 Suppl . 
7:7-23, 1978, Rodwell et al . , Biotech. , 3:889-894, 
1984, and U.S. Pat. No. 4,493,795. 

The diagnostic systems can also include, 
preferably as a separate package, a specific binding 
agent. A "specific binding agent" is a molecular 
entity capable of selectively binding a reagent 
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species of the present invention or a complex 
containing such a species, but is not itself a 
polypeptide or phage of the present invention. 
Exemplary specific binding agents are antibody 
molecules, complement proteins or fragments thereof, 
S . aureus protein A, and the like. Preferably the 
specific binding agent binds the reagent species when 
that species is present as part of a complex. 

In preferred embodiments, the specific binding 
agent is labeled. However, when the diagnostic system 
includes a specific binding agent that is not labeled, 
the agent is typically used as an amplifying means or 
reagent. In these embodiments, the labeled specific 
binding agent is capable of specifically binding the 
amplifying means when the amplifying means is bound to 
a reagent species-containing complex. 

The diagnostic kits of the present invention can 
be used in an "ELISA" format to detect the quantity of 
a preselected substrate or ligand in a fluid sample. 
"ELISA" refers to an enzyme-linked immunosorbent assay 
that employs an antibody or antigen bound to a solid 
phase and an enzyme -antigen or enzyme -antibody 
conjugate to detect and quantify the amount of an 
antigen present in a sample and is readily applicable 
to the present methods. A description of the ELISA 
technique is found in Chapter 22 of the 4th Edition of 
Basic and Clinical Immunology by D.P. Sites et al . , 
published by Lange Medical Publications of Los Altos, 
CA in 1982 and in U.S. Patents No. 3,654,090; No. 
3,850,752; and No. 4,016,043, which are all 
incorporated herein by reference. 

Thus, in some embodiments, a phage or fusion 
polypeptide of the present invention can be affixed to 
a solid matrix to form a solid support that comprises 
a package in the subject diagnostic systems. 
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A reagent is typically affixed to a solid matrix 
by adsorption from an aqueous medium although other 
modes of affixation applicable to proteins and 
polypeptides can be used that are well known to those 
5 skilled in the art. Exemplary adsorption methods are 

described herein. 

Useful solid matrices are also well known in the 
art. Such materials are water insoluble and include 
the cross -linked dextran available under the trademark 
10 SEPHADEX from Pharmacia Fine Chemicals (Piscataway, 

NJ) ; agarose; beads of polystyrene beads about 1 
micron to about 5 millimeters in diameter available 
from Abbott Laboratories of North Chicago, IL; 
polyvinyl chloride, polystyrene, cross-linked 
15 polyacrylamide, nitrocellulose- or nylon-based webs 

such as sheets, strips or paddles; or tubes, plates or 
the wells of a microtiter plate such as those made 
from polystyrene or polyvinylchloride . 

The reagent species, labeled specific binding 

2 0 agent or amplifying reagent of any diagnostic system 

described herein can be provided in solution, as a 
liquid dispersion or as a substantially dry power, 
e.g., in lyophilized form. Where the indicating means 
is an enzyme, the enzyme's substrate can also be 
25 provided in a separate package of a system. A solid 

support such as the before-described microtiter plate 
and one or more buffers can also be included as 
separately packaged elements in this diagnostic assay 
system. 

3 0 The packaging materials discussed herein in 

relation to diagnostic systems are those customarily 
utilized in diagnostic systems . 

Instructions for the use of the packaged 
reagent (s) are also typically included. 11 Instructions 
3 5 for use " typically include a tangible expression 
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describing the reagent concentration or at least one 
assay method parameter such as the relative amounts of 
reagent and sample to be admixed, maintenance time 
periods for reagent/ sample admixtures, temperature, 
5 buffer conditions and the like. 

The packaging materials discussed herein in 
relation to diagnostic systems are those customarily 
utilized in diagnostic systems. 

The materials for use in the assay of the 

10 invention are ideally suited for the preparation of a 

kit. Such a kit may comprise a carrier means being 
compartmentalized to receive in close confinement one 
or more container means such as vials, tubes, and the 
like, each of the container means comprising- one of 

15 the separate elements to be used in the method. For 

example, one of the container means may comprise an 
enzyme or receptor of the invention which is, or can 
be, detectably labelled. The kit may also have 
containers containing any of the other above-recited 

2 0 immunochemical reagents used to practice the 

diagnostic methods. 

Examples 

The following examples are intended to 

2 5 illustrate, but not limit, the scope of the invention. 

1 . Construction of a Tricistronic Expression Vector 
Xfoo for Displaying a Protein on the Surface of 
Lambda Phaae Particles 
30 Expression libraries in bacteriophage M13 have 

previously been constructed for generating a large 
number of Fab antibody fragments displayed on the 
surface of filamentous phage. The displayed Fabs can 
be screened directly for a desired binding activity. 

3 5 These systems are primarily useful in the surface 
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display of secreted proteins and do not contain design 
features that provide for the surface display of 
nonsecreted or cytoplasmic proteins. While these 
systems provide for the expression of multimeric 
proteins, two separate cloning steps are required. In 
addition, these systems do not provide for the 
expression of multimeric proteins consisting of both 
the surface display fusion protein and the unfused 
wild-type protein monomer or monomers. Such 
expression may be required for the proper assembly of 
multimeric proteins. The assembly of the displayed 
protein into multimers may be required for the 
recognition of the multimeric complex by immunological 
reagents such as antibodies and for biological 
activities such as catalytic, substrate- or ligand- 
binding activities. 

The main criterion used in choosing a vector 
system was the necessity of generating a system which 
would be more appropriate for proteins that fold in 
the cytoplasm. Bacteriophage lambda was selected as 
the expression vector for several reasons . First , in 
vitro packaging of phage DNA was the most efficient 
method of reintroducing DNA into host cells. Second, 
it was possible to detect protein expression at the 
level of a single plaque. Finally, bacteriophage 
lambda is assembled in the cytoplasm from proteins 
which are produced in the cytoplasm. 

A lambda phage expression system, Afoo, which 
produces biologically active proteins displayed on the 
surface of the phage particle was constructed. The 
biologically active proteins described in this 
invention are multimeric proteins. The biologically 
active protein consists of a truncated pV and a 
monomer of the displayed protein which are linked 
during translation by a peptide bond to form a fusion 
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protein and one or more free monomers. The fusion 
protein and one or more free monomers assemble to form 
a multimeric complex. 

The fusion protein and monomers are encoded by 
genes inserted into the X vector and are expressed by 
conditional suppression. Conditional suppression 
allows the expression of the truncated pV' protein, 
fusion protein, and monomeric proteins from a single 
mRNA transcript. The expression of three separate 
proteins from a single mRNA transcript in the Xfoo 
expression vector is termed tricistronic expression 
and is described further herein. In addition, 
conditional suppression expresses the fusion protein 
and monomers in a ratio relative to the other X 
proteins which favors the assembly of functional phage 
particles . 

One of the biologically active proteins of this 
invention comprises a truncated pV and S-gal fusion 
protein (pV-S-gal fusion protein) and S-gal monomers 
which are also assembled to form a tetrameric complex. 
These proteins are encoded by genes inserted into the 
XS-gal vector and are expressed by conditional 
suppression. A tetrameric complex comprising the pV- 
S-gal fusion protein and the individual S-gal monomers 
must be formed for biological activity. Three 
monomers of E-gal assemble with the pV-S-gal fusion 
protein to form the biologically active JS-gal protein 
on the surface of the phage particles. The biological 
activity of S-gal refers to the ability of S-gal to 
cleave a galactoside sugar to form two sugars. 

Similarly, a truncated pV and BPA fusion protein 
(pV-BPA fusion protein) and BPA monomers also 
assembled to form a tetrameric complex. These 
proteins are encoded by genes inserted into the XBPA 
vector and are expressed by conditional suppression. 
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A tetrameric complex comprising the pV-BPA fusion 
protein and the individual BPA monomers must be formed 
for biological activity. Three monomers of BPA 
assemble with the pV-BPA fusion protein to form the 
biologically active BPA protein on the surface of the 
phage particles. The biological activity of BPA 
refers to the ability of BPA to bind mucin. 

One of skill in the art would realize that the 
truncated pV protein may be fused with a biologically 
active protein, a monomer which assembles with 
identical monomers to form a homomeric biologically 
active protein, or a monomer which assembles with 
nonidentical monomers to form a heteromeric 
biologically active protein. 

A series of X phage vectors with pV' , a truncated 
form of pV, were constructed and are described herein. 

a . Construction of the Expression Vector XV' sac 
The first vector, XV' sac was constructed 
with a truncated pV protein to determine if functional 
phage particles would be assembled with the truncated 
pV protein. Wild- type pV is a 25.8 kDa protein and 
contains 246 amino acid residues. XV' sac has an amber 
stop codon, TAG, in place of GAG coding for Glu at 
amino acid residue 177 of the pV gene and produces a 
truncated pV protein which is 18.8 kDa with only 176 
amino acid residues when the pV protein is expressed 
in a nonsuppressor strain. The truncated pV, 
designated pV' , is therefore 70 amino acids shorter 
than the wild-type pV. pV' is expressed in all of the 
X vectors described in this invention. XV' sac also 
has a unique SacI restriction site, two base pairs 
downstream of the amber codon, which was used as an 
insertion site in the construction ■ of additional X 
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vectors (Figure 1) . 

1) Preparation of XV sac 

XV' sac was generated from a combination 
of the left arm of X1685 [blOOI a ( att-int ) assIIXI-2 
KH54 sr! X4° nin 5 shndIH X6°] and the right arm of X2 0 00 
(a derivative of X2001 lacking the polylinker sites 
(Karn et al . , Gene , 32:217, 1984; X2001 is available 
from the American Type Culture Collection, ATCC No. 
3 7474). XV' sac contains an amber mutation and a 
unique SacI restriction site in the V gene of X phage. 

To generate XV' sac, X1685 was digested with BamHI 
and Nsil to release a 4.82 kb BamHI-Nsil segment of 
DNA which contained genes Nu3 through G (Figure 1) . 
The 4.82 kb BamHI-Nsil segment of DNA was inserted 
into pUM13 (Maruyama et al . , Gene , 120:135-141, 1992) 
which was also digested with BamHI and Nsil using T4 
DNA ligase and standard protocols such as those 
described in Current Protocols in Molecular Biology , 
Ausubel et al., eds . , John Wiley and Sons, NY, 198 7. 
The pMU13 plasmid is a derivative of pBluescribe M13 + 
(Stratagene, La Jolla, California) and was made by 
inserting an oligonucleotide which contained 
additional Xhol and Nsil restriction sites in frame 
into the polylinker sequence. A 0.96 kb BclI-EcoRI 
fragment from the pUM13 plasmid containing the segment 
of X1685 was inserted into the BamHI and EcoRI site of 
the M13mpi9 phage vector (Pharmacia, Piscataway, New 
Jersey) . Site-directed mutagenesis was performed as 
described in Zoller et al . , DNA , 3:479, 1984 using the 
single-stranded form of phage DNA as template and the 
oligonucleotide 5 ' -AGTGTGGAGCTCTACCCTTTC-3 ' (SEQ ID NO 
2) . The site-directed mutagenesis introduced an amber 
codon and a SacI site following nucleotide 9482 at 
amino acid 176 of the V gene. The segment containing 
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the amber codon and SacI site was excised by digesting 
the M13mpl9 construct with Aatll and EcoRI . This 
fragment and a 4 . 9 kb BamHI-Aatll fragment of the 
pUM13 clone were ligated at the common Aatll site and 
5 then into the pUC12 vector (Karn, et al . , Gene, 

32:217, 1984) which was digested with BamHI and EcoRI. 
A 4.5 kb fragment was excised by digestion with ApaLI 
and Apal from this construct and ligated with a 5.6 kb 
left arm of X1685 digested with ApaLI and a 33 kb 
10 right arm of X2000 digested with Apal. This final 

ligation produced the XV' sac vector which contains an 
amber mutation and a SacI restriction site in the V 
gene, the left arm of X1685, and the right arm of 
X2000 . 

15 2} Expression of pV and Wild-Type pV 

Proteins from XV 7 sac by Conditional 
Suppression 

Conditional suppression was used to 
characterize the expression of pV' and wild-type pV 

20 encoded by XV 7 sac with several E. coli strains. This 

characterization was performed to verify that 
functional phage particles could be produced by with 
only the truncated pV protein. Conditional 
suppression was used to vary the amount of pV' and 

25 wild- type pV expressed by varying the amber suppressor 

E. coli strain used to express the pV proteins. The 
E. coli strains were Ql ( thr, leu, lac, supE ) , EQ170 
[a ( lac-proAB ) , nalA, rif, arqE aT n , metB, supE ] , EQ16 6 
Fa ( lac-proAB ) , nalA, rif, arqE^ f ara l , CA168 (Hfr C, 

30 lacIQ2, gal , supB ) ,. MC8 ( lac!25, supG , trp i m ) , TGI 

[a ( lac-proAB ) , supE, thi , hsdP5 /F' , traD3 6 , proAB * , 
lacI q ZA M15 l , EQ8 2 ( hsdR, supE , supF , met ) , and Q3 5 8 
( hsdR, supE, 08Q R ) . EQ166 does not produce a 
suppressor tRNA which can suppress an amber mutation 

35 and is therefore designated as an su" strain. The 
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remaining hosts contain a suppressor tRNA which can 
suppress the amber mutation and are therefore 
designated su + . 

The su* strains tested in this invention produce 
5 varying amounts of suppressor tRNA. The frequency 

with which the suppressor tRNA inserts an amino acid 
residue at the amber codon is dependent upon the 
amount of suppressor tRNA present in the E. coli 
strain. Therefore, the frequency with which the su + 
10 strains tested in this invention would insert an amino 

acid residue at the amber codon would vary and thus 
the ratio of truncated pV to wild-type pV produced 
would vary. 

Growth of pV'sac in several E. coli strains was 

15 characterized to verify that functional phage 

particles would be formed when only the truncated pV 
protein was incorporated into the tail portion of the 
phage particle. The phage particles produced in the 
E. coli strains with a suppressor tRNA contain a wild- 

20 type pV in addition to the truncated pV due to read- 

through and suppression of the amber mutation by the 
host, respectively. The full-length or wild-type pV 
is produced by the insertion of a codon at the amber 
codon by the suppressor tRNA. The phage particles 

25 produced in the E. coli strains without a suppressor 

tRNA contain only the truncated pV due to lack of 
suppression of the amber mutation by the host. The 
formation of plaques in the E. coli host EQ166 (su) 
would therefore confirm previous findings that a pV 

3 0 molecule that is truncated at the carboxy terminus can 

participate in functional phage particle assembly. 

Growth in the E. coli strains was determined by 

the formation of plaques on a lawn on E_. coli . 

Briefly, to form plaques oh a lawn of E. coli , phage 

35 particles are mixed with the E. coli cells and 
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incubated at 37°C for 15 minutes to allow the phage to 
adsorb to the lamB receptor on the surface of the 
cells and inject the X DNA into the cells. Cells and 
phage are then mixed with 0.7% (w/v) agar in a 
5 suitable medium which has been melted and equilibrated 

to 48 °C. After mixing, the mixture is poured onto the 
surface of 2% (w/v) agar in a suitable medium, allowed 
to solidify, and incubated at 37°C until the E . coli 
has grown to form a confluent layer, or lawn, on the 

10 surface of the agar. The presence of clear areas on 

the lawn represent where a functional phage particle 
has infected the E. coli cells and caused lysis of the 
E. coli cells. Thus, a plaque represents a functional 
phage particle. Because pV protein is required for 

15 the assembly of a functional phage particle, a plaque 

also represents that pV protein capable of assembly to 
form a functional phage particle was expressed from 
the X vector which encodes the pV protein. The 
morphology of the plaque, such as size and clarity, is 

2 0 an indication of the number of functional phage 

particles produced by the E. coli strain. 

XV 7 sac produced plaques which were similar in 
size as one of the parental vectors, X2001, on many of 
the E. coli strains tested. This indicates that the 

25 truncated pV expressed from XV' sac produced functional 

phage particles as well as X2001 with wild-type pV in 
the nonsuppressor E . coli strains tested. 

b. Construction of Expression Vector XV'mcs 
XV'mcs was constructed from XV' sac by 

30 inserting double-stranded synthetic oligonucleotides 

which contain multiple restriction sites at the unique 
SacI site of XV' sac to form a multiple cloning site 
(mcs) . The restriction sites inserted were SacI, 
Hindlll, BamHI, Smal, Sail, and EcoRI. 
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1) Expression of pV Proteins from XV' mcs 
bv Conditional Suppression 
Growth of the XV' mcs was characterized 
with several E. coli hosts. XV' mcs grows well on su 
5 hosts such as EQ166 but produces a slightly smaller 

plaque on su + hosts such as Q3 58. XV' mcs produces a 
truncated pV' protein on su" hosts and a combination 
of pV' and a pV' protein with the amino acid residues 
encoded by the mcs inserted at amino acid residue 176 
10 of the pV protein in su + hosts. Thus, the presence of 

the amino acid residues encoded by the mcs in the pV 
protein may effect the assembly of infectious phage 
particles in su + hosts. 

c. Construction of Expression Vector Xblue-a 
15 Xblue-a was constructed from XV' sac and 

XV'mcs by inserting a unique NotI site and the 
multiple cloning site from pUC19 (Pharmacia, 
Piscataway, New Jersey) at the unique SacI site. In 
addition, the a-peptide sequence of the S~gal gene 

2 0 with the S-gal ribosome binding site or Pribnow box 

and an Sfil site were inserted downstream of the 
multiple cloning site. Thus, Xblue-a is unique from 
the previously constructed X vectors by the presence 
of the multiple cloning site from pUC19, a unique NotI 
25 site, and the a-peptide of the S-gal gene. 

The of-peptide of the S-gal gene provides or- 
complementation of the S-gal gene to produce 
functional S-gal in an appropriate E. coli host and is 
used in the identification of X vectors containing a 

3 0 - DNA insert. If a DNA insert is ligated into the mcs 

between the S-gal promoter and the a-peptide, the a- 
peptide will not be expressed and ^-complementation of 
the S-gal gene to produce functional S-gal will not 
occur. Functional S-gal activity is detected by the 
35 formation of a blue color by cleavage of the S-gal 
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subs t rat e 5 - bromo - 4 - chloro - 3 - indoyl - £ - D - 

galactopyranoside (X-gal) . When a DNA insert is not 
present, ^-complementation occurs, functional S-gal is 
produced, and X-gal is cleaved to form a blue color. 
When X-gal is included in the medium, phage plaques 
formed by a X vector with an insert in the rocs will 
remain colorless and phage plaques formed by a Xvector 
without an insert in the mcs will turn blue. Thus, 
the presence of the a-peptide in Xblue-a provides a 
means for the identification of recombinants 
containing a DNA insert in the mcs. 

The first step in the construction of Xblue-a was 
to digest pUC19 with Hindi I I and Narl to generate a 
fragment which contains the multiple cloning site of 
pUC19 and the a-peptide sequence of the S-gal gene. 
This fragment was ligated with an oligonucleotide 
linker containing a Not I site (SEQ ID NOs 3 and 4) and 
Narl and Sad compatible overhangs on the 5' and 3' 
ends, respectively. The two oligonucleotides given in 
SEQ ID NOs 3 and 4 were annealed to generate the 
oligonucleotide linker. The ligated fragment 
consisting of the a-peptide sequence of the S-gal 
gene, the multiple cloning site of pUC19, and the 
oligonucleotide linker containing a NotI site were 
then ligated with the left and right vector arms from 
XV' mcs and XV' sac, respectively. The ligated fragment 
was ligated with the left arm of XV' mcs digested with 
Hindlll and the right arm from XV' sac digested with 
SacI . 

1) Expression of pV and Wild-Tvpe pV 

Proteins from Xblue-ce bv Conditional 
Suppression 

Conditional suppression was used to 
characterize growth of Xblue-a in several E. coli 
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strains. The ability of Xblue-or without an insert to 
express the a-peptide of S-gal and thus provide the a- 
complementation needed to produce functional S-gal was 
determined by forming plagues on a lawn of E. coli as 
described above and with the substrate X-gal in the 
medium. Plaques formed by Xblue-a formed blue color 
in the presence of X-gal indicating that Xblue-a did 
not contain a DNA insert and expressed the a-peptide. 
The Xblue-a grew well on all E. coli hosts tested 
indicating that the construction of the vector did not 
interfere with expression of genes downstream of the V 
gene . 

d. Construction of Expression Vector Xblue 
Xblue was constructed from Xblue-a by 

inserting a double -stranded synthetic oligonucleotide 
which encodes a unique restriction site, Sfil, a 
ribosomal binding site, and an initiator methionine 
codon following the amber stop codon in pV and before 
the mcs. The insertion of this oligonucleotide 
removed the original Sad site adjacent to the amber 
stop codon. 

1) Expression of pV and Wild-Tvpe pV 
Proteins from Xblue bv Conditional 
Suppression 

Conditional suppression was used to 
characterize expression of Xblue by conditional 
suppression in several suppressor and nonsuppressor E . 
coli strains. Xblue produced smaller phage plaques 
than XV' sac and XV' mcs when plated with EQ166 (su + ) 
and Q3 5 8 (su~) . 

e . Construction of Expression Vector Xfoo 
Xfoo was constructed by inserting a double- 
stranded synthetic oligonucleotide encoding a linker 
peptide consisting of alternating prolyl and theonyl 
residues, termed the Pro-Thr sequence of Celluiomonas 



- 118 - TSRI 432.0 

f imi endo-£-l, 4-glucanase (Wong, et al., Gene, 44:315- 
324, 1986) , at the unique Sfil site of Xblue (Figure 
2). In the endo-S-1, 4-glucanase molecule the Pro-Thr 
box links two domains, the cellulose-binding domain 
and catalytic domain in C. fimi oxoglucanase (Cex) and 
endoglucanase (CenA) , and is similar in structure to 
the hinge-region of IgAl immunoglobulins (Ong et al . , 
Biotechnology , 7:604-607, 1989). The Pro-Thr linker 
in Xfoo serves to separate pV' from the fused foreign 
protein and may prevent the two domains from 
interfering with each other during folding and 
assembly of the phage tail proteins to form the phage 
tail matrix. The linker also allows foreign proteins 
to be cleaved from the phage particles by digestion 
with enzymes such as C. fimi protease (Gilkes et al . , 
J. Biol . Chem. , 263:10401-10407, 1988) or collagenase. 

The Pro-Thr linker sequence also provides the 5' 
expression control elements required to express a DNA 
sequence inserted in the mcs of Xfoo (Figure 2) . 
Within the Pro-Thr linker sequence are a ribosome 
binding site and methionine start codon upstream of 
the Hindlll , BamHI , SacI , and EcoRI cloning sites . 
Thus, a DNA sequence inserted into any of these sites 
that is in the same reading frame as the start codon 
methionine would be expressed from the ribosome 
binding site and start codon contained within the Pro- 
Thr linker sequence . Expression from this ribosome 
binding site and start codon is not dependent on the 
presence of a suppressor tRNA and thus occurs in both 
su" and su + hosts to produce a monomer of the protein 
encoded by the inserted DNA. 

1) Expression of pV' and Wild-Type pV 
Proteins from Xfoo by Conditional 
Suppression 
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Conditional suppression was used to 
characterize expression of Xfoo by conditional 
suppression in several E. coli strains. Xfoo produced 
smaller plagues than XV' sac and XV'mcs when plated 
with EQ16S (su~) and Q358 (su ) . 

2 . Methods of Producing a Functionally Active S-gal 
Fusion Protein on the Surface of Lambda Phage 
Particles 

To express E. coli S-gal fused to the carboxy 
terminus of pV and incorporate S-gal into the surface 
of phage particles, the entire S-gal gene was inserted 
into and expressed by the Xfoo vector. 

a . Construction of a Tricistronic Expression 
Vector XS-gal for Producing a Biologically 
Active Multimeric Protein on the Surface of 
Lambda Phage Particles 

The tricistronic expression vector XS-gal 
was constructed by ligating the S-gal gene excised 
from pMC1871 (Pharmacia, Piscataway, New Jersey) to 
the left arm of Xfoo and the right arm of XV'mcs. The 
left arm of Xfoo was used because it contains the Pro- 
Thr linker sequence between the V gene, amber codon, 
and Sfil site. The right arm of XV'mcs was used 
because it does not contain the a-peptide of the S-gal 
gene that provides a means for a- complement at ion. If 
the right arm of a vector, such as Xfoo, which 
contains the a-peptide of the S-gal gene were used in 
the construction of the XS-gal vector, both the vector 
without insert and the S-gal vector would have S-gal 
activity and generate plaques with a blue color in the 
presence of X-gal. Thus, it would not be possible to 
distinguish between phage containing the S-gal insert 
and those without the S-gal insert by the color 
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screening method described herein. 

1) Insertion of fi-cral Gene into W'mcs 
The S-gal gene was inserted into the 

BamHI site of Xfoo to generate X/S-gal. This was 
5 performed by first digesting pMC1871 containing the S- 

gal gene, Xfoo, and XV'mcs with BamHI and then 
ligating the fragments together with T4 DNA ligase. 

2) Expression of pV/fi-aal Fusion P rotein 
and Incorporation into Phage Par ticles 

10 by Conditional Suppression 

The expression of recombinant phage 
containing the left arm of Xfoo, the S-gal insert, and 
the right arm of XV'mcs was characterized by 
conditional suppression in several E. coli strains. 

15 Growth in a su host would express only the pV' 

protein and the S-gal monomeric protein. The pV' -fi- 
gal fusion protein consisting of the pV' , linker 
peptide, and fi-gal monomer would not be expressed. 
The fusion protein would not be expressed because a 

20 tRNA which can insert an amino acid at the amber codon 

in the pV coding sequence is not present in a su" 
host. The pV protein would therefore be truncated at 
the amber codon present at amino acid residue 177 of 
the pV protein. The monomeric S-gal protein would be 

25 expressed from the ribosome binding site and 

methionine start codon contained within the coding 
sequence of the Pro-Thr linker. The monomeric fi-gal 
protein is encoded by the DNA insert in the X vector. 
Thus, although the S-gal monomers expressed from the 

3 0 ribosome binding site in the Pro-Thr linker peptide 

would assemble to form a S-gal multimeric complex with 
enzymatic activity, the S-gal multimeric complex would 
not be incorporated into- phage particles by means of 
the pY-S-gal fusion protein. 



35 
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Growth in a su + host would express the pV' , pV-S- 
gal fusion protein, and S-gal monomeric proteins from 
the tricistronic message. The pV-S-gal fusion 
protein, consisting of pV' , the Pro-Thr linker 
peptide, and monomeric S-gal would be incorporated 
into the phage particle by means of the pV' matrix 
anchor. Monomeric S-gal expressed from the ribosome 
binding site and start codon in the Pro-Thr linker 
sequence would also be expressed and assemble with the 
py-S-gal fusion protein to form a enzymatically active 
S-gal on the surface of the phage particles. 

The recombinant phage, XS-gal, grown on E. coli 
host EQ166 formed blue plagues on agar plate 
containing the color indicator X-gal, indicating that 
active S-gal had been produced. However, only S-gal 
monomers were produced in this su" strain and S-gal 
was not incorporated onto the surface of the phage 
particles. Translation of the wild- type S-gal 
monomers was initiated at the ribosomal binding site 
in the linker peptide of the construct. 

The recombinants were then grown on host strains 
with varying amounts of suppressor activity to 
determine whether functional phage particles were 
produced. The phage grew well and thus produced high 
numbers of phage particles in a liquid medium with 
EQ166 (su") , CA168 (su B + ) and MC8 (su/) . However, the 
phage grew poorly on strains which produce higher 
numbers of suppressor tRNAs such as TGI (su n + ) , Ql 
(su n + ) , EQ170 (su € + ), and EQ82 (su n + su in + ) . On a solid 
medium, XS-gal made smaller plaques on TGI (su n + ) and 
Q358 (su IT + ) than when grown in EQ166 (su") and MC8 
(su/) . These results suggest that the production of 
large amounts of fusion protein (pV' -S-gal) by hosts 
which produce higher numbers of suppressor tRNAs have 
an adverse effect on phage particle assembly. The 
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frequency with which an amino acid residue is inserted 
at the amber codon determines the ratio of pV' -S-gal 
fusion protein, pV' protein, and S-gal protein 
produced in the strain. If an amino acid residue is 
5 inserted at the amber codon, a pV' -S-gal fusion 

protein, pV' protein, and S-gal monomeric protein are 
produced. If an amino acid residue is not inserted at 
the amber codon, the pV' and S-gal proteins are 
produced. The pV' protein is produced by initiation 

10 of translation at the naturally-occurring methionine 

at the start of the coding sequence. The pV' protein 
is terminated at amino acid residue number 176 at the 
position of the amber codon in the absence of a 
suppressor tRNA. The S-gal monomer is produced by 

15 initiation of translation at the methionine which is 

encoded by the Pro-Thr linker sequence between the 
nucleotide sequence encoding the pV' protein and the 
inserted DNA encoding S-gal. Therefore, the E. coli 
strains tested in this invention produce varying 

20 ratios of the pV' -S-gal fusion protein, pV' , and S-gal 

monomers . 

The recombinant phage, XS-gal, grown on su + L. 
coli hosts formed blue plaques on agar plate 
containing, the color indicator X-gal, indicating that 
25 active S-gal had been made. However, S-gal activity 

does not conclusively demonstrate that S-gal was 
incorporated onto the surface of the phage particles. 
Phage particles were therefore purified to eliminate 
S-gal proteins that were not incorporated into the 
30 surface of phage particles. 

3 ) Preparation of Phage Particles 
Displaying S-aal Multimer 
Phage particles grown on EQ166 (su") 
and MC8 (su + ) were produced and purified for further 
35 experiments which demonstrate that S-gal had been 
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incorporated onto the surface of the phage particles. 
Growth in a su + E. coli strain MC8 produced the pV' -S- 
gal fusion protein, pV' , and wild-type S-gal monomers. 
These proteins were either assembled into the phage 
5 particles to form functionally active pV' -S-gal on the 

surface of the phage particles or were separate from 
the phage particles in the culture. The purification 
step was necessary to separate the phage particles 
containing the functionally active pV' -S-gal fusion 

10 protein from the remaining proteins in the culture. 

One of skill in the art would realize the wild- 
type S-gal would need to be provided in some manner in 
order to form the functionally active S-gal tetrameric 
complex and that the wild- type S-gal monomer may be 

15 provided in cis or in trans . The conditional 

suppression method of expression, of this invention, 
is one example of providing the wild-type S-gal 

monomer in cis . 

4) Immunoblottina of Phaae Particles 

2 0 Displaying S-gal Multimer 

The incorporation of the pV' -S-gal 
fusion protein into phage particles was also detected 
by analysis of proteins on a polyacrylamide-SDS gel 
stained with PAGE blue 83 (BDH, Poole, England) and 

25 immunoblotted with an a-S-gal antibody. Phage 

particles from the Xfoo vector without insert and XS- 
gal were produced in MC8 (su + ) and EQ166 (su") as 
described in Example 2a2 above. 

Phage proteins were dissociated by boiling in a 

30 sample buffer (63 mM tris-Cl, pH 6.8; 10% (v/v) 

glycerol, 1% (w/v) SDS, 1% (v/v) S-mercaptoethanol , 
0.0 005% Bromophenol blue). Phage proteins were 
further analyzed by standard methods as described in 
Molecular Cloning: A Laboratory Manua l. Second 

35 Edition . Maniatis et al . , eds . , Cold Spring Harbor, 
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NY, 1989) • Phage proteins were separated by 10% SDS- 
polyacrylamide gel electrophoresis (PAGE) . After 
electrophoresis, viral proteins in the SDS-PAGE gel 
were either stained with PAGE blue 83 or 
5 electrophoretically transferred onto a nitrocellulose 

membrane at 3 00 amps for 15 hours in a cold room to 
generate a nitrocellulose blot. Nitrocellulose blots 
of viral proteins were preblocked by washing four 
times in a blocking buffer (Ix phosphate-buffered 

10 saline [PBS] containing 5% [w/v] nonfat dry milk, 0.1% 

[v/v] Nonidet P-40, and 0.05% [w/v] sodium azide) at 
room temperature for 3 0 minutes. The blots were then 
incubated with a primary antibody, the mouse 
monoclonal anti-S-gal, at 5 fig /ml (Promega, Madison, 

15 WI) at room temperature for 3 0 minutes. After washing 

three times with the blocking buffer, the filter was 
incubated with a secondary antibody, goat anti-mouse 
IgG, conjugated with alkaline phosphatase (Promega, 
Madison, WI) . After removing unbound antibodies by 

2 0 washing as described above, the alkaline phosphatase 

conjugated antibody bound to the S-gal on the filter 
was reacted with nitro blue tetrazolium (NBT) at a 
concentration of 50 /ig/ml in 70% dimethylf ormamide 
(DMF) and 5-bromo-4-chloro-3-indolyl phosphate (BCIP) 

25 at a concentration of 50 /zg/ml in 70% DMF to form an 

insoluble blue precipitate on the filter. The color 
development was stopped by rinsing the filter twice in 
deionized water. 

Three phage proteins were produced with XS-gal 

.30 and Xfoo in EQ166 (su~) and MC8 (su 4 ) , respectively, 

as visualized in the PAGE blue 83 stained gel (Figure 
3) . Four phage proteins were produced with XS-gal in 
the E. coli host MC8 (su + ) . Two of the four phage 
proteins produced with XS-gal in MC8 (su + ) react with 

3 5 the S-gal antibody. The two proteins represent the 
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fusion protein (pV' -S-gal) and the S-gal monomeric 
proteins which were incorporated into the surface of 
the bacteriophage particle. No proteins which were 
recognized with the S-gal antibody were detected in 
either the XS-gal or Xfoo in EQ166 and MC8, 
respectively. XS-gal expresses only S-gal monomers in 
the su" strain EQ166 and does not express the fusion 
protein (pV' -S-gal) which would be incorporated onto 
the surface of the phage particle and assemble with S- 
gal monomers. Xfoo does not contain a S-gal gene and 
therefore does not express S-gal proteins. 

5) S-gal Activity Assay with Phage 

Particles Displaying S-gal Multimer 
A comparison of S-gal activity 
associated with phage particles expressed from XS-gal 
and Xfoo in the EQ166 (su*) and MC8 (su + ) E. coli 
strains was made. Phage particles of XS-gal and Xfoo 
grown on EQ166 (su") and MC8 (su + ) were purified by 
ultracentrifugation in Example 2a3. S-gal activity of 
the purified phage particles was measured according to 
methods described by Miller ( A Short Course in 
Bacterial Genetics , Cold Spring Harbor Lab. Press, 
Plainview, NY, 1992) . XS-gal phage particles prepared 
from the culture grown on EQ166 (su") had only 
residual S-gal activity compared to that of Xfoo. 
However, XS-gal phage particles prepared in MC8 (su + ) 
had substantial activities when compared to Xfoo. XS- 
gal phage particles expressed in EQ166 (su") had 3% of 
the S-gal activity of phage particles expressed in MC8 
(su + ) . These results indicate that both the pV' -S-gal 
fusion protein and wild-type S-gal monomers were 
expressed from XS-gal in MC8 (su + ) . They also 
indicate that the pV' -S-gal fusion protein was 
incorporated into the phage particles and that the 
wild-type S-gal monomers had assembled with pV' -S-gal 
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on the surface of the phage particles to form a 
functionally active S-gal tetramer. 

6) Electron Microscopy of Phage Particles 
Displaying S-gal Multimer 

5 The presence of S-gal multimers on the 

surface of phage particles was visualized in electron 
microscopy. S-gal phage particles were produced in 
Q358 (su + ) in liquid culture and purified by 
centrifugation through a CsCl gradient as described in 

10 Example 2a3 . Phage were negatively stained with 

uranyl acetate and visualized at a magnification of 
60,200x. Among about 700 phages analyzed by electron 
microscopy, derived from two separate phage 
preparations, 55% of phage have no S-gal on the 

15 surface, 41% have one S-gal molecule, 3% have two S- 

gal molecules, and only 1% have three or more S-gal 
molecules. These results indicate that more than two 
molecules on a phage on a virion particle may have an 
inhibitory effect on the tail assembly and explain the 

2 0 poor growth of fusion phage particles on hosts with 

strong suppressor activity. 

7) Affinity Selection of Phage Particles 
Displaying S-gal Multimer 
Affinity selection of phage particles 

25 displaying the S-gal multimer by a modified enzyme- 

linked immunosorbent assay (ELISA) was essentially as 
described by Engvall et al . , Immunochemistry , 8:871- 
879, 1971. The input ratio of phage particles 
produced by XS-gal and XV' sac was varied and the 

3 0 output ratio determined by the color screening method 

using X-gal as described in Example 2a2. Enrichment 
of phage particles produced by XS-gal over XV 7 sac was 
determined by a comparison of the input to output 
phage {Table 1) . 

35 
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Microtiter wells (Dynatech, Chant illy, VA) were 
coated with 5 fig/ml mouse monoclonal anti-S-gal 
antibody in coating buffer (0.1 M carbonate buffer, pH 
9.6) at 4°C overnight. Unbound antibody was removed 
5 by washing twice with a blocking buffer (Ix phosphate 

buffered saline (PBS) with 0.1% (v/v) Tween 20, 0.25% 
(w/v) bovine serum albumin (BSA) , 5% (w/v) skimmed 
milk, and 0.1% (w/v) sodium azide) . Nonspecific 
binding sites in the wells were blocked with the 

10 blocking buffer at room temperature for one hour, 

washed twice with X-dil (10 mM tris-Cl, pK 7.4, 5 mM 
MgCl 2 , 0.2 M NaCl, 0.1% (w/v) gelatin). Then 50 fil of 
a mixture of XS-gal and XV 7 sac phage in X-dil was 
added to the wells and incubated at room temperature 

15 for one hour. Unbound phage were removed by washing 

with a washing solution (lx PBS, 5% (w/v) skimmed 
milk, and 0.5% (v/v) Tween 20) at 37 °C for 30 
minutes. Non-specif ically bound phage were eluted 
with elution buffer (10 mM tris-Cl, pH 7.4, 5 mM 

20 MgS0 4 , 0.2 M NaCl, 10 mM CaCl 2 ) at 37°C for 30 

minutes. Specifically bound phage were eluted from 
the wells with 20 units of collagenase in a elution 
buffer at 37°C for 3 0 minutes. Eluates were assayed 
for infectivity of EQ166 by adding 0.1 ml of overnight 

25 culture of the bacterium in CY medium {10 g casamino 

acids, 5 g yeast extract, 3 gm NaCl, 2--g KCl in 1 
liter of distilled water, pH 7.0) supplemented with 5 
mM MgS0 4 and incubated at 37°C for 15 minutes. The 
infected bacteria were plated as previously described 

3 0 in Example 2a2 with X-gal. The numbers of blue and 

colorless plaques representing XS-ga-1 and XV' sac, 
respectively, were determined (Table 1) . The 
enrichment of phage produced by XJS-gal to XV' sac was 
determined by comparing the numbers of input to output 

35 phage particles. 



- 128 - 



TSRI 432.0 



Table 1 



Input phage 
Xfi-aal/XV sac 

10V10 10 

10 4 /10 9 

10V10 8 

io 4 /io 7 



Output Phage 
Xft-aal/XV sac 

129/162 

108/39 

168/25 
96/6 



Enrichment 
7.7 x 10 5 
3.0 x 10 5 
1.7 x 10 s 
1.6 x 10 4 



10 



15 



20 



25 



10 3 /10 10 

io 3 /io 9 
io 3 /io e 
io 3 /io 7 
io 3 /io 6 



5/106 
99/32 

8/11 
10/5 

8/0 



5.3 x 10 5 
2.8 x 10 5 
0.8 x 10 5 
0.2 x 10 4 
>8.0 X 10 3 



30 



Microtiter plates coated with anti-S-gal 
antibodies were used to demonstrate the efficiency 
with which the fusion phage, XS-gal, can be purified 
by affinity chromatography. A mixture of XS-gal and 
XV' sac phage grown on MC8 (su + ) in liquid culture were 
subjected to a single round of affinity selection. As 
shown above in Table 1, a 10 4 - to I0 6 -fold enrichment 
of XS-gal phage over the reference phage XV' sac 
occurred, using various ratios of input phage. The 
average efficiency of recovery of the X£-gal phage was 
about 1% of the input phage. These efficiencies of 
enrichment and recovery with Xfoo are comparable to 
those using the filamentous phage vectors (Scott et 
al., Science , 249:386-390, 1990 and McCafferty et al . , 
Nature , 348:552-554, 1990) . 



35 



Methods of Producing a Functionally Ac tive BPA 
Fusion Protein on the Surface of Lambda Phage 
Particles 

Another multimeric protein, BPA, was also 



- 129 - TSRI 432.0 

incorporated onto the surface of phage particles by 
the same methods described in Example 2 for 
incorporation of S-gal on to the surface of phage 
particles by expression in the Xfoo vector. 
5 A. Insertion of BPA Gene into Xfoo 

The XBPA gene was amplified from B . purpurea 
lectin cDNA (Kusui, J. Biochem. , 109:899-903, 1991) by 
PCR with a pair of oligonucleotide primers (5'- 
GTCTGCAGCACAAGCTCAACCTTA-3 7 , SEQ ID NO 7 and 5'- 

10 GAGAATTCTTTACATACTGGAATAAGAG-3 ' , SEQ ID NO 8). The 

two primers were designed to obtain the cDNA encoding 
a mature lectin protein lacking a leader sequence or 
signal peptide and with a Hindi I I restriction, site at 
the 5' end and BamHI site at the 3' end. The PCR 

15 fragment was directionally inserted into pUC18 

(Pharmacia, Piscataway, New Jersey) which had been 
digested with Hindlll and BamHI. Vectors containing 
inserts were identified as white colonies from 
transformants of E. coli TGI host ( Fa ( lac-proAB ) , 

20 supE, thi, hsdD4/F , / traD3 6 , proAB , lacI q ZA MI5 l on an 

indicator plate containing the inducer isopropyl-S-D- 
galactopyranoside (IPTG) and the color substrate X- 
gal. IPTG induces expression of the BPA gene from the 
lacZ promoter in the pUC18 vector. X-gal forms a blue 

25 color in the presence of IS- gal as explained in Example 

2a2. Thus, colonies which contain the BPA insert 
would form white colonies in the presence of IPTG and 
X-gal. The presence of the BPA insert was confirmed 
by DNA sequencing (Sanger et al . , J. Mol . Biol. 

30 162: 729 , 1982) . The cDNA encoding BPA was 

subsequently transferred into the Xfoo vector by 
directional insertion at the Hindi I I and BamHI cloning 
sites and the construct was confirmed by restriction 
mapping. 



35 
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B . Expression of pV/BPA Fusion Protein and 
Incorporation into Phage Particles by 
Conditional Suppression 
The expression of recombinant phage 
containing the BPA insert in Xfoo was characterized by 
conditional suppression in several E. coli strains as 
described in Example 2a2. Growth in a su" host would 
express only the pV' protein and the BPA monomeric 
protein. The pV' -BPA fusion protein consisting of the 
pV' , linker peptide, and BPA monomer would not be 
expressed. The fusion protein would not be expressed 
because a tRNA which can insert an amino acid at the 
amber codon in the pV coding sequence is not present 
in a su" host. The pV protein would therefore be 
truncated at the amber codon present at amino acid 
residue 177 of the pV protein. The monomeric BPA 
protein would be expressed from the ribosome binding 
site and methionine start codon contained within the 
coding sequence of the Pro-Thr linker. The monomeric 
BPA protein is encoded by the DNA insert in the A 
vector. Thus, although the BPA monomers expressed 
from the ribosome binding site in the Pro-Thr linker 
peptide would assemble to form a BPA multimeric 
complex with enzymatic activity, the BPA multimeric 
complex would not be incorporated into phage particles 
by means of the pV-BPA fusion protein. 

Growth in a su + host would express the pV' , pV- 
BPA fusion protein, and BPA monomeric proteins from 
the tricistronic message. The pV-BPA fusion protein, 
consisting of pV' , the Pro-Thr linker peptide, and 
monomeric BPA would be incorporated into the phage 
particle by means of the pV' matrix anchor. Monomeric 
BPA expressed from the ribosome binding site and start 
codon in the Pro-Thr linker sequence would also be 
expressed and assemble with the pV-BPA fusion protein 
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to form a enzymatically active BPA on the surface of 
the phage particles. 

C. Preparation of Phage Particles Displaying 
BPA Multimer 

Phage particles grown on the MC8 (su + ) 
strain were produced and purified as described in 
Example 2a2 for affinity selection experiments which 
demonstrate that BPA had been incorporated onto the 
surface of the phage particles. Growth in a su + E. 
coli strain MC8 produced the pV' -BPA fusion protein, 
pV' , and wild-type BPA monomers. These proteins were 
either assembled into the phage particles to form 
functionally active pV' -BPA on the surface of the 
phage particles or were separate from the phage 
particles in the culture. The purification step was 
necessary to separate the phage particles containing 
the functionally active pV' -BPA fusion protein from 
the remaining proteins in the culture. 

Phage particles grown on the MC8 (su + ) strain 
were produced and partially purified for competitive 
EL ISA experiments which demonstrate that binding of 
BPA to mucin is competititively inhibited in the. 
presence of known antagonists . Growth in a su + E . 
coli strain MC8 produced the pV' -BPA fusion protein, 
pV' , and wild-type BPA monomers. These proteins were 
either assembled into the phage particles to form 
functionally active pV' -BPA on the surface of the 
phage particles or were separate from the phage 
particles in the culture. The purification step was 
necessary to separate the phage particles containing 
the functionally active pV' -BPA fusion protein from 
the remaining proteins in the culture. 

XBPA were grown on the MC8 (su + ) strain in CY 
medium and partially purified by precipitation with 
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polyethylene glycol (PEG) with a molecular weight 
cutoff, M r , of 8000 as described in Molecula r Cloning: 
A Laboratory Manual, Second Edition , Maniatis et al . , 
eds., Cold Spring Harbor, NY, 1989). 

5 

D. Affinity Selection of Phaae Particles 
Displaying BPA Multimer 

Phage particles which display the BPA 
multimer were affinity selected in ELISA assays as 

10 described in Example 2a7 for selection of phage 

particles which display the BPA multimer with the 
following modifications. Bovine submaxillary glands 
(mucin, Sigma, St. Louis, MO) was used as the ligand 
and 1 mM CaCl 2 was added to the binding and washing 

15 buffers. CaCl 2 was added to the buffers because BPA 

binding to mucin is calcium-dependent . XBPA phages 
were enriched 10 3 - to 10 4 -fold over the reference 
XV 7 sac phage in the ELISA assays for affinity 
selection with about a 0.01% recovery of input phage. 

20 The lower enrichment and recovery may be due to the 

lower association constant of BPA to the mucin 
compared to that of the S-gal and its antibody. 

E. Competitive ELISA to Determine Binding 

2 5 Specificity of Phage Particles Displaying 

BPA Multimer 

Phage particles displaying BPA on their 
surface as prepared in Example 2a2 were then analyzed 
by competition ELISA to determine the specificity of 

30 binding of BPA to various sugars. The competitive 

ELISA was performed on microtiter wells coated with 
mucin. Increasing concentrations of soluble or free 
glucose, maltose, lactose, or galactose from 0.00 to 
0.1 M were admixed with 10 10 phage particles displaying 

35 BPA on their surface. After incubation and washes as 
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described in Example 2a7, bound phage particles were 
eluted from the microtiter wells with collagenase as 
described in Example 2a7. Data presented is the mean 
of three separate experiments. 
5 Lactose and galactose inhibited the binding of 

XBPA to mucin by 50% at concentrations of 0.1 to 0.3 
M, respectively. Maltose inhibited the binding of 
XBPA to mucin by 50% at approximately 0.7M. Glucose 
inhibited the binding of XBPA to mucin by only 20% at 
10 1.0M. These results are consistent with evidence that 

BPA binds specifically to S-D-galactose residues and 
especially to GalSl-3 -N-acetylgalactosamine (Osawa et 
al., Ann. Rev. Biochem. , 56:21-42, 1987). 

15 4 . Methods of Producing a Library of Bio logically 

Active Fusion Polypeptides on the Surface of 
Lambda Phage Particles 

The method for producing and segregating a 
desired biologically active multimeric complexes can 

20 be applied to the identification of a desired 

biologically active multimeric complex from a 
population of phage. Said population, or library, of 
phage contains phage which each display a unique 
biologically active multimeric complex in a surface 

25 accessible manner and contain a genome which encodes 

the multimeric complex. 

The method by which the desired biologically 
active multimeric complex is segregated from the 
library and thus is identified has been described 

3 0 herein. 

Deposit of Materials 

The E. coli microorganism, MC8, and bacteriophage 
lambda DNA, yfoo, were deposited on August 4, 1994, 
35 with the American Type Culture Collection, 1301 
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Parklawn Drive, Rockville, MD, USA (ATCC) . The 
deposit provides a microorganism that is capable of 
infection by, replication of, and expression of the 
lambdoid expression vectors described in Example 1. 
The bacteriophage lambda DNA, -yfoo, is a lambdoid 
expression vector of this invention and is described 
in Example 1. 

This deposit was made under the provisions of the 
Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for the Purpose of 
Patent Procedure and the Regulations thereunder 
(Budapest Treaty) . This assures maintenance of a 
viable microorganism and bacteriophage lambda DNA for 
3 0 years from the date of deposit. The microorganism 
and bacteriophage lambda DNA will be made available by 
ATCC under the terms of the Budapest Treaty which 
assures permanent and unrestricted availability of the 
progeny of the culture and DNA to the public upon 
issuance of the pertinent U.S. patent or upon laying 
open to the public of any U.S. or foreign patent 
application, whichever comes first, and assures 
availability of the progeny to one determined by the 
U.S. Commissioner of Patents and Trademarks to be 
entitled thereto according to 35 U.S.C. §122 and the 
Commissioner's rules pursuant thereto (including 37 
CFR §1.14 with particular reference to 886 OG 638) . 
The assignee of the present application has agreed 
that if the culture or DNA deposit should die or be 
lost or destroyed when cultivated under suitable 
conditions, it will be promptly replaced on 
notification with a viable specimen of the same 
culture or DNA, respectively. Availability of the 
deposited strain and DNA is not to be construed as. a 
license to practice the invention in contravention of 
the rights granted under the authority of any 
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government in accordance with its patent laws. 

The foregoing is intended as illustrative of the 
present invention but not limiting. Numerous 
variations and modifications can be effected without 
departing from the true spirit and scope of the 
invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Maruyama, Ichiro 
Maruyama, Hiroko 
Brenner, Sydney 

(ii) TITLE OF INVENTION: LAMBDOID BACTERIOPHAGE VECTORS FOR 
EXPRESSION OF FOREIGN PROTEINS 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: The Scripps Research Institute, Office 

Patent Counsel 

(B) STREET: 10666 North Torrey Pines Road, TPC8 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 05 -AUG- 1994 

(C) CLASSIFICATION: 

(viil) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fitting, Thomas 

(B) REGISTRATION NUMBER: 34,163 

(C) REFERENCE/DOCKET NUMBER: TSRI432.0 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-554-2937 

(B) TELEFAX: 619-554-6312 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Pro Val Pro Asn Pro Thr Met Pro Val Lys Gly Ala Gly Thr Thr 
15 10 15 

Leu Trp Val Tyr Lys Gly Ser Gly Asp Pro Tyr Ala Asn Pro Leu Ser 
20 25 30 

Asp Val Asp Trp Ser Arg Leu Ala Lys Val Lys Asp Leu Thr Pro Gly 
35 40 45 

Glu Leu Thr Ala Glu Ser Tyr Asp Asp Ser Tyr Leu Asp Asp Glu Asp 
50 55 60 

Ala Asp Trp Thr Ala Thr Gly Gin Gly Gin Lys Ser Ala Gly Asp Thr 
65 70 75 80 

Ser Phe Thr Leu Ala Trp Met Pro Gly Glu Gin Gly Gin Gin Ala Leu 
85 90 95 

Leu Ala Trp Phe Asn Glu Gly Asp Thr Arg Ala Tyr Lys lie Arg Phe 
100 105 110 

Pro Asn Gly Thr Val Asp Val Phe Arg Gly Trp Val Ser Ser He Gly 
115 120 125 

Lys Ala Val Thr Ala Lys Glu Val He Thr Arg Thr Val Lys Val Thr 
130 135 140 

Asn Val Gly Arg Pro Ser Met Ala Glu Asp Arg Ser Thr- Val Thr Ala 
145 150 155 160 

Ala Thr Gly Met Thr Val Thr Pro Ala Ser Thr Ser Val Val Lys Gly 
165 170 175 

Gin Ser Thr Thr Leu Thr Val Ala Phe Gin Pro Glu Gly Val Thr Asp 
180 185 190 

Lys Ser Phe Arg Ala Val Ser Ala Asp Lys Thr Lys Ala Thr Val Ser 
195 200 205 

Val Ser Gly Met Thr He Thr Val Asn Gly Val Ala Ala Gly Lys Val 
210 215 220 

Asn He Pro Val Val Ser Gly Asn Gly Glu Phe Ala Ala Val Ala Glu 
225 230 235 240 

He Thr Val Thr Ala Ser 
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245 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
AGTGTGGAGC TCTACCCTTT C 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
CGCCTGTAAT AAGCGGCCGC AGCT 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: 'nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GCGGCCGCTT ATTACAGG 
(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16. .897 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

AAATGTGAGG ACGCT ATG CCT GTA CCA AAT CCT ACA ATG CCG GTG AAA GGT 51 
Met Pro Val Pro Asn Pro Thr Met Pro Val Lys Gly 
15 10 

GCC GGG ACC ACC CTG TGG GTT TAT AAG GGG AGC GGT GAC CCT TAC GCG 99 
Ala Gly Thr Thr Leu Trp Val Tyr Lys Gly Ser Gly Asp Pro Tyr Ala 
15 20 25 

AAT CCG CTT TCA GAC GTT GAC TGG TCG CGT CTG GCA AAA GTT AAA GAC 147 
Asn Pro Leu Ser Asp Val Asp Trp Ser Arg Leu Ala Lys Val Lys Asp 
30 35 40 

CTG ACG CCC GGC GAA CTG ACC GCT GAG TCC TAT GAC GAC AGC TAT CTC 195 
Leu Thr Pro Gly Glu Leu Thr Ala Glu Ser Tyr Asp Asp Ser Tyr Leu 
45 50 55 60 

GAT GAT GAA GAT GCA GAC TGG ACT GCG ACC GGG CAG GGG CAG AAA TCT 243 
Asp Asp Glu Asp Ala Asp Trp Thr Ala Thr Gly Gin Gly Gin Lys Ser 
65 70 75 

GCC GGA GAT ACC AGC TTC ACG CTG GCG TGG ATG CCC GGA GAG CAG GGG 291 
Ala Gly Asp Thr Ser Phe Thr Leu Ala Trp Met Pro Gly Glu Gin Gly 
80 85 90 

CAG CAG GCG CTG CTG GCG TGG TTT AAT GAA GGC GAT ACC CGT GCC TAT 339 
Gin Gin Ala Leu Leu Ala Trp Phe Asn Glu Gly Asp Thr Arg Ala Tyr 
95 100 105 

AAA ATC CGC TTC CCG AAC GGC ACG GTC GAT GTG TTC CGT GGC TGG GTC 387 
Lys He Arg Phe Pro Asn Gly Thr Val Asp Val Phe Are Gly Trp Val 
HO 115 120 

AGC AGT ATC GGT AAG GCG GTG ACG GCG AAG GAA GTG ATC ACC CGC ACG 435 
Ser Ser He Gly Lys Ala Val Thr Ala Lys Glu Val He Thr Arg Thr 
125 130 135 140 
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GTG AAA GTC ACC AAT GTG GGA CGT CCG TCG ATG GCA GAA GAT CGC AGC 483 
Val Lys Val Thr Asn Val Gly Arg Pro Ser Met Ala Glu Asp Arg Ser 
145 150 155 

ACG GTA ACA GCG GCA ACC GGC ATG ACC GTG ACG CCT GCC AGC ACC TCG 531 
Thr Val Thr Ala Ala Thr Gly Met Thr Val Thr Pro Ala Ser Thr Ser 
160 165 170 

GTG GTG AAA GGG TAG AGC TGG CCT GTT AGG CCC ACT CCG ACC CCG ACC 579 
Val Val Lys Gly * Ser Trp Pro Val Arg Pro Thr Pro Thr Pro Thr 
175 180 185 

ACT CCC ACC CCG ACT CCC ACC CCG ACC CCG ACC CCG ACT CCG ACC GTT 627 
Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Val 
190 195 200 

GGG CCA ATT GTC ACA CAG GAA ACA GCT ATG ACC ATG ATT ACG CCA AGC 675 
Gly Pro He Val Thr Gin Glu Thr Ala Met Thr Met He Thr Pro Ser 
205 210 215 . 220 

TTG CAT GCC TGC AGG TCG ACT CTA GAG GAT CCC CGG GTA CCG AGC TCG 723 
Leu His Ala Cys Arg Ser Thr Leu Glu Asp Pro Arg Val Pro Ser Ser 
225 230 235 

AAT TCA CTG GCC GTC GTT TTA CAA CGT CGT GAC TGG GAA AAC CCT GGC 771 
Asn Ser Leu Ala Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly 
240 245 250 

GTT ACC CAA CTT AAT CGC CTT GCA GCA CAT CCC CCT TTC GCC AGC TGG 819 
Val Thr Gin Leu Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Trp 
255 260 265 

CGT AAT AGC GAA GAG GCC CGC ACC GAT CGC CCT TCC CAA CAG TTG CGC 867 
Arg Asn Ser Glu Glu Ala Arg Thr Asp Arg Pro Ser Gin Gin Leu Arg 
270 275 280 

AGC CTG AAT GGC GAA TGG CGC CTG TAA TAAGCGGCCG CAGCTC 910 
Ser Leu Asn Gly Glu Trp Arg Leu * 
285 290 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Met Pro Val Pro Asn Pro Thr Met Pro Val Lys Gly Ala Gly Thr Thr 
15 10 15 

Leu Trp Val Tyr Lys Gly Ser Gly Asp Pro Tyr Ala Asn Pro Leu Ser 
20 25 30 

Asp Val Asp Trp Ser Arg Leu Ala Lys Val Lys Asp Leu Thr Pro Gly 
35 40 45 

Glu Leu Thr Ala Glu Ser Tyr Asp Asp Ser Tyr Leu Asp Asp Glu Asp 
50 55 60 

Ala Asp Trp Thr Ala Thr Gly Gin Gly Gin Lys Ser Ala Gly Asp Thr 
65 70 75 80 

Ser Phe Thr Leu Ala Trp Met Pro Gly Glu Gin Gly Gin Gin Ala Leu 
85 90 95 

Leu Ala Trp Phe Asn Glu Gly Asp Thr Arg Ala Tyr Lys lie Arg Phe 
100 105 110 

Pro Asn Gly Thr Val Asp Val Phe Arg Gly Trp Val Ser Ser He Gly 
115 120 125 

Lys Ala Val Thr Ala Lys Glu Val He Thr Arg Thr Val Lys Val Thr 
130 135 140 

Asn Val Gly Arg Pro Ser Met Ala Glu Asp Arg Ser Thr Val Thr Ala 
145 150 155 160 

Ala Thr Gly Met Thr Val Thr Pro Ala Ser Thr Ser Val Val Lys Gly 
165 170 175 

* Ser Trp Pro Val Arg Pro Thr Pro Thr Pro Thr Thr Pro Thr Pro 
180 185 190 

Thr Pro Thr Pro Thr Pro Thr Pro Thr Pro Thr Val Gly Pro He Val 
195 200 205 

Thr Gin Glu Thr Ala Met Thr Met He Thr Pro Ser Leu His Ala Cys 
210 215 220 

Arg Ser Thr Leu Glu Asp Pro Arg Val Pro Ser Ser Asn Ser Leu Ala 
225 230 235 240 

Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val Thr Gin Leu 
245 250 255 



Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Trp Arg Asn Ser Glu 
260 265 270 



Glu Ala Arg Thr Asp Arg Pro Ser Gin Gin Leu Arg Ser Leu Asn Gly 
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275 280 

Glu Trp Arg Leu * 
290 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GTCTGCAGCA CAAGCTCAAC CTTA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(£) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAGAATTCTT TACATACTGG AATAAGAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
Val Val Lys Gly 



285 



24 



28 
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1 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTGGTGAAAG GGTAGAGCTC CACACTG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAGTGTGGAG CTCTACCCTT TCACCAC 27 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE ..CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Ser Ser Ser Leu Asp Pro Gly Pro Ser Thr Asn Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGGTAGAGCT CAAGCTTGGA TCCGGGCCCG TCGACGAATT C 41 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Trp Pro Val Giy Pro He Val Thr Gin Glu Thr Ala Met Thr Met 
15 10 15 

He Thr Pro Ser Leu His Ala Cys Arg Ser Thr Leu Glu Asp Pro Arg 
20 25 30 

Val Pro Ser Ser Asn Ser 
35 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 
(£) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



145 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGGTAGAGCT GGCCTGTTGG GCCAATTGTC ACACAGGAAA CAGCTATGAC CATGATTACG 60 
CCAAGCTTGC ATGCCTGCAG GTCGACTCTA GAGGATCCCC GGGTACCGAG CTCGAATTC 119 
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What Is claimed Is: 

1 . A recombinant lambdoid bacteriophage vector 
comprising a nucleotide sequence that (i) defines the 
lambdoid elements for replication and packaging of the 
vector into an assembled bacteriophage, and (ii) 
encodes a conditionally suppressible cistron for 
expression of a tail protein and a fusion protein that 
comprises : 

a) a promoter for transcribing the cistron, 

b) a first upstream translatable sequence that 
encodes a lambdoid bacteriophage tail polypeptide, 

c) a first ribosome binding site to initiate 
translation of said upstream translatable sequence, 

d) a second translatable sequence operatively 
linked downstream to said first translatable sequence 
that (i) encodes a linker polypeptide in frame with 
said tail polypeptide and (ii) includes a sequence 
adapted for ligation of an insert polynucleotide that 
defines a third translatable sequence downstream from 
said second translatable sequence that encodes a 
preselected polypeptide, and 

e) a suppressor termination codon within said 
second translatable sequence that upon suppression 
results in read-through to form a fusion polypeptide 
consisting of said tail polypeptide, linker 
polypeptide and preselected polypeptide. 

2. The vector of claim 1 wherein said second 
translatable sequence further includes a nucleotide 
sequence that defines a second ribosome binding site 
to initiate translat ion of said third translatable 
sequence . 

3. The vector of claim^ wherein said lambdoid 
bacteriophage tail polypeptide is selected from the 
group consisting of pj, pV, pG, pM and pT. 

4. The vector of claim 1 wherein said lambdoid 
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bacteriophage tail polypeptide is pV. 

5. The vector of claim 4 wherein said pV 
includes residues 1-176 of the~amino acid residue 
sequence shown in SEQ ID NO 6 , and conservative 
substitutions thereof . 

6. The vector of claim 1 wherein said 
suppressor termination codon is selected from the 
group consisting of the amber and opal codons . 

7. The vector of claim 1 wherein said linker 
polypeptide is from about 10 -=bcp about 100 amino acids 
in length. 

8. The vector of claim ^-wherein said linker 
polypeptide has a amino acid residue sequence from 178 
to 213 as shown in SEQ ID NO 6. 

9. The vector of claim 1 wherein said 
conditionally suppressible cistron has a nucleotide 
sequence shown in SEQ ID NO 5. 

10. The vector of claim 1 wherein said vector 
has a nucleotide sequence f uncrbionally similar to the 
sequence qz Xfoo having ATCC accession number . 



11/ A recombinant lambdoid bacteriophage 
comprising a matrix of proteins encapsulating a 
lambdoid genome encoding a fusion protein, said matrix 
including said fusion protein surface accessible in 
said matrix, and said fusion protein consisting 
essentially of, in the direction of amino terminus to 
carboxy terminus, a lambdoid bacteriophage tail 
polypeptide, a linker polypeptide and a preselected 
polypeptide . 

12. The lambdoid bacteriophage of claim JL1 
wherein said lambdoid bacteriophage tail polypeptide 
is selected from the group consisting of pJ, pV, pG, 
pM and pT. 

13 . The lambdoid bacteriophage vector of claim 
11 wherein said lambdoid bacteriophage tail 




- 148 - TSRI 432.0 

polypeptide is pV. 

14. The lambdoid bacteriophage of claim 13 
wherein said pV includes residues 1-176 of the amino 
acid residue sequence shown in SEQ ID NO 1 , and 
conservative substitutions thereof . 

15. The lambdoid bacteriophage of claim 11 
wherein said preselected polypeptide defines aT 
biologically active protein selected from the group 
consisting of an enzyme, a ligand and a receptor. 

16. The lambdoid bacteriophage of claim 11 
wherein said lambdoid genome further encodes a 
heterologous protein capable of forming a multimeric 
protein complex with said fusion protein in ■ said 
matrix. 

17. The lambdoid bacteriophage of claim 11 
wherein said fusion protein is present as a multimeric 
protein. 

18. The lambdoid bacteriophage of claim 17 
wherein said multimeric protein is selected ffom the 
group consisting of beta-galactosidase and Bauhinia 
purpurea agglutinin. 

19. The lambdoid bacteriophage of claim 11 
wherein said linker polypeptide has an amino acid 
residue sequence from 178 to 213 as shown in SEQ ID NO 
6. 

20. The lambdoid bacteriophage of claimjj. 
wherein said bacteriophage is detectably labeled. 

2l/ A fusion protein having an amino acid 
residue sequence that comprises, in the direction of 
amino terminus to carboxy terminus, a lambdoid 
bacteriophage tail polypeptide, a linker polypeptide 
and a preselected polypeptide defining a biological 
activity. 

22 . A library of recombinant lambdoid 
bacteriophage particles wherein each particle contains 
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a recombinant lambdoid bacteriophage vector according 
to claim 1. 

23. The library of claim 22 wherein said library 
contains at least 10 7 different species of said 
vector. 

24. A library of recombinant lambdoid 
bacteriophage particles wherein each particle 
comprises a matrix of proteins encapsulating a 
lambdoid genome, said matrix including a fusion 
protein according to claim^l^ surface accessible in 
said matrix. 

25. A method for detecting the presence of a 
preselected target in a sample comprising the steps 
of: 

a) admixing a sample containing said 
preselected target with a recombinant lambdoid 
bacteriophage according to claim 15, wherein said 
preselected polypeptide defines a biologically active 
ligand or receptor able to bind said preselected 
target, under binding conditions sufficient for said 
target -binding bacteriophage to bind said target and 
form a target-ligand or receptor complex; 

b) detecting the presence of said complex, 
and thereby the presence of said preselected target. 

26. The method of claim 25 wherein said 
detecting comprises detect ing^f he presence of said 
bacteriophage particles, and thereby the presence of 
said preselected target. 

27. A method for producing a recombinant 
lambdoid bacteriophage, comprising the steps of: 

a) infecting an E. coli host strain having 
a termination codon suppression phenotype with a 
recombinant lambdoid bacteriophage vector according to 
c 1 aim-.! ; _ and 

b) culturing said infected host strain 
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under bacteriophage growth conditions to produce said 
recombinant lambdoid bacteriophage. 

28. The method of claim 27 wherein said E. coli 
host strain is selected from the group consisting of 
EQ166, CA168 and MC8 . 

29. The method of claim 28 wherein said MC8 has 
the characteristics of ATCC accession number . 
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ABSTRACT 

Lambdoid phage comprising a matrix of proteins 
encapsulating a genome encoding first and second 
polypeptides of an autogenously assembling receptor 
and a receptor comprised of the first and second 
polypeptides surface-integrated into the matrix via 
lambdoid phage tail protein matrix anchor domain fus 
to at least one of the polypeptides. 
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PATENT APPLICATION DECLARATION AND POWER OF ATTORNEY . 



I HEREBY DECLARE THAT: 

My residence, post office address, and citizenship are as stated next to my name 
in PART A of page 2 hereof . 

I believe I am the original, first, and sole inventor (if only one name is 
listed) or an original, first, and joint inventor (if plural names are listed) 
of the subject matter which is claimed and for which a patent xs sought on the 

invention entitled LAMBDOID BACTERIOPHAGE VECTORS FOP . 

EXPRESSION OF FOREIGN PROTEINS 

the specification of which: 

is attached hereto 

X was filed on August 5. 1994 as Application Serial No. 08/286,888 

and was amended on (if applicable) . 

I hereby state that I have reviewed and understand the contents of the above- 
identified specification, including the claims, as amended by any amendment 
referred to above . 

v Acknowledge the duty to disclose information which is material to the 
elimination of this application in accordance with Title 37, Code of Federal 
Regulations, Sec. 1.56(a). 

rtereby claim foreign priority benefits under Title 35, United States Code, 
M: 119 of any foreign application (s) for patent or inventor's certificate 
llited in PART B on page 2 hereof and have also identified m PART B on page 2 
hQreof any foreign application for patent or inventor's certificate having a 
filling date before that of the application on which priority is claimed. 
Aereby claim the benefit under Title 35, United States Code, Sec. 120 of any 
United States application (s) listed in PART C on page 2 hereof and, insofar as 
tJSb subject matter of each of the claims of this application is not disclosed 
iff the prior United States application in the manner provided by the rirsu 
paragraph of Title 35, United States Code, Sec. 112, I acknowledge the duty to 
disclose material information as defined in Title 37, Code of Federal 
Regulations, Sec. 1.56(a) which occurred between the filing date or the prior 
plication and the national or PCT international filing date ox this 
application. 

I hereby declare that all statements made herein of my knowledge are true and 
that all statements made on information and belief are believed to be true^ and 
further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willrul 
false statements may jeopardize the validity of the application or any patent 
issued thereon. 

I hereby appoint the following as my attorneys or agents with full power of 
substitution to prosecute this application and transact all business m the 
United States Patent and Trademark Office connected therewith: 

Douglas A. Bingham Reg. No. 32,457 April C. Logan Reg. No. 33,950 

Thomas Fitting Reg. No. 34,163 Donald G. Lewis Reg. No. 28,636 

whose mailing address for this application is: 

THE SCRIPPS RESEARCH INSTITUTE 
10666 North Torrey Pines Road 
Mail Drop TPC 8 
La Jolla, California 92037 



See Page 2 attached, signed, and made a part hereof . 
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PATENT APPLICATION DECLARATION AND POWER OF ATTORNEY 



PART A: Inventor Information And Signature 



Full name of SOLE or FIRST inventor _ 

Citizenship Japan Post Office Address 



Ichiro Maruvama 



4657 Da Vinci Street 



San Dieao, California 92130 



Residence (if different) 
Inventor 7 s Signature : < 



Date : 



Hiroko Maruvama 



Full name of SECOND joint inventor, if any 

Citizenship Japan Post Office Address 4657 Da Vinci Street 



San Dieao, California 92130 



Residence (if different] 



Second Inventor's Signature 



Date: 9//* 



name of THIRD joint inventor, if any 

Citizenship British Post Office Address 



Sydney Brenner 



17-B St. Edwards Passage 



Cambridge, CB2 3PJ 



Residence (if different) 



England 



T&ird Inventor's Signature: 



Full name of FOURTH joint inventor, if any 

Citizenship Post Office Address 




Date: lb] I?- 



RSidence (if different) 



Firfirth Inventor's Signature:. 



Date 



a 



||1 name of FIFTH joint inventor, if any 

Slfizenship " Post Office Address 



Residence (if different) 



Fifth Inventor' s Signature : 

PART B: Prior Foreign Application (s) 

Serial No. Country Day/Month/Year Filed 



Date : 



Priority Claimed 

Yes No 

No 



Yes 



PART C: Claim For Benefit of Filing Date of Earlier U.S. Application ( s 
Serial No. Filing Date 



Status : 

Patented 

Patented 

Patented 

Patented 



Pending 
Pending 
Pending 
Pending 



Abandoned 
Abandoned 
Abandoned 
Abandoned 



See Page 1 to which this is attached and from which this Page 2 continues. 
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